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RECEPTOR 

Field 

This invention relates to newly identified nucleic acids, polypeptides encoded by 
5 them and to their production and use. More particularly, the nucleic acids and polypeptides 
of the present invention relate to a G-protein coupled receptor (GPCR), hereinafter 
referred to as "Tarzan GPCR", and members of the purinoceptor family of GPCRs. The 
invention also relates to inhibiting or activating the action of such nucleic acids and 
polypeptides. 

10 Background 

It is well established that many medically significant biological processes are 
mediated by proteins participating in signal transduction pathways that involve G-proteins 
and/or second messengers, for example, cAMP (Lefkowitz, Nature^ 1991, 351: 353-354). 
These proteins are referred to as proteins participating in pathways with G-proteins or 

1 5 "PPG proteins". Some examples of these proteins include the GPC receptors, such as those 
for adrenergic agents and dopamine (Kobilka, B. K., et al., Proc. Natl Acad ScL, USA, 
1987, 84: 46-50; Kobilka B. K., et al., Science, 1987, 238: 650-656; Bunzow, J. R., et al., 
Nature, 1988, 336: 783-787), G-proteins themselves, effector proteins, for example, 
phospholipase C, adenyl cyclase, and phosphodiesterase, and actuator proteins, for 

20 example, protein kinase A and protein kinase C (Simon, M. I., et al.. Science, 1 99 1 , 252: 
802-8). 

For example, in one form of signal transduction, the effect of hormone binding is 
activation of the enzyme adenylate cyclase inside the cell. Enzyme activation by hormones 
is dependent on the presence of the nucleotide, GTP. GTP also influences hormone 
25 binding. A G-protein connects the hormone receptor to adenylate cyclase. G-protein is 
shown to exchange GTP for bound GDP when activated by a hormone receptor. The GTP 
canying form then binds to activated adenylate cyclase. Hydrolysis of GTP to GDP, 
catalysed by the G-protein itself, returns the G-protein to its basal, inactive form. Thus, the 
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G-protein serves a dual role, as an intermediate that relays the signal from receptor to 
effector, and as a clock that controls the duration of the signal. 

The membrane protein gene superfamily of G-protein coupled receptors (GPCRs) 
has been characterised as having seven putative transmembrane domains. The domains are 
5 believed to represent transmembrane a-helices connected by extracellular or cytoplasmic 
loops. G-protein coupled receptors include a wide range of biologically active receptors, 
such as hormone, viral, growth factor and neuroreceptors. 

G-protein coupled receptors (also known as 7TM receptors) have been 
characterised as including these seven conserved hydrophobic stretches of about 20 to 30 

10 amino acids, coimecting at least eight divergent hydrophilic loops. The G-protein family 
of coupled receptors includes dopamine receptors which bind to neuroleptic drugs used for 
treating psychotic and neurological disorders. Other examples of members of this family 
include, but are not limited to, calcitonin, adrenergic, endothelin, cAMP, adenosine, 
muscarinic, acetylcholine, serotonin, histamine, thrombin, kinin, follicle stimulating 

15 hormone, opsins, endothelial differentiation gene-1, rhodopsins, odorant, and 
cytomegalovirus receptors. 

Most G-protein coupled receptors have single conserved cysteine residues in each 
of the first two extracellular loops which form disulphide bonds that are believed to 
stabilise functional protein structure. The 7 transmembrane regions are designated as TMl, 
20 TM2, TM3, TM4, TM5, TM6, and TM7. tM3 has been implicated in signal transduction. 

Phosphorylation and lipidation (pamitylation or famesylation) of cysteine residues 
can influence signal transduction of some G-protein coupled receptors. Most G-protein 
coupled receptors contain potential phosphorylation sites within the third cytoplasmic loop 
and/or the carboxy terminus. For several G-protein coupled receptors, such as the p - 
25 adrenoreceptor, phosphorylation by protein kinase A and/or specific receptor kinases 
mediates receptor desensitization. For some receptors, the ligand binding sites of G- 
protein coupled receptors are believed to comprise hydrophilic sockets formed by several 
G-protein coupled receptor transmembrane domains, the sockets being surrounded by 
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hydrophobic residues of the G-protein coupled receptors. The hydrophilic side of each G- 
protein coupled receptor transmembrane helix is thought to face inward and form a pol2ir 
ligand binding site. TM3 has been implicated in several G-protein coupled receptors as 
having a ligand binding site, such as the TM3 aspartate residue. TM5 serines, a TM6 
5 asparagine and TM6 or TM7 phenylalanines or tyrosines are also implicated in ligand 
binding. 

G-protein coupled receptors can be intracellularly coupled by heterotrimeric G- 
proteins to various intracellular enzymes, ion channels and transporters (see, Johnson et 
al., Endoc. Rev., 1989, 10: 317-331). Different G-protein a-subunits preferentially 

1 0 stimulate particular effectors to modulate various biological functions in a cell. 
Phosphorylation of cytoplasmic residues of G-protein coupled receptors has been 
identified as an important mechanism for the regulation of G-protein coupling of some G- 
protein coupled receptors. G-protein coupled receptors are found in numerous sites within 
a mammalian host. Over the past 1 5 years, nearly 350 therapeutic agents targeting 7 

1 5 transmembrane (7 TM) receptors have been successfully introduced onto the market. 

Thus, G-protein coupled receptors have an established, proven history as 
therapeutic targets. Clearly there is a need for identification and characterization of further 
receptors which can play a role in preventing, ameliorating or correcting dysfimctions or 
diseases, including, but not limited to, infections such as bacterial, fungal, protozoan and 

20 viral infections, particularly infections caused by HIV-l or HIV-2; pain; cancers; diabetes, 
obesity; anorexia; bulimia; asthma; parkinson's disease; thrombosis; acute heart failure; 
hypotension; hypertension; erectile dysfunction; urinary retention; metabolic bone 
diseases such as osteoporisis and osteo petrosis; angina pectoris; myocardial infarction; 
ulcers; asthma; allergies; rheumatoid arthritis; inflammatory bowel disease; irritable bowel 

25 syndrome benign prostatic hypertrophy; and psychotic and neurological disorders, 

including anxiety, schizophrenia, manic depression, delirium, dementia, severe mental 
retardation and dyskinesias, such as Huntington's disease or Gilles dela Tourett's 
syndrome. 
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Summary of the Invention 

According to a first aspect of the present invention, we provide a Tarzan GPCR 
polypeptide comprising the amino acid sequence shown in SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 6 or SEQ ID NO: 8, or a homologue, variant or derivative thereof. 

5 There is provided, according to a second aspect of the present invention, a nucleic 

acid capable of encoding a polypeptide according to the first aspect of the invention. 
Preferably, the nucleic acid comprises the nucleic acid sequence shown in SEQ ID NO: 1, 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 7 or SEQ ID NO: 10, or a homologue, 
variant or derivative thereof. 

10 We provide, according to a third aspect of the present invention, a polypeptide 

comprising a fragment of a polypeptide according to the first aspect of the invention. 

Preferably, such a Augment containing polypeptide comprises one or more regions 
which are homologous between any two or more of SEQ ID NO: 3, SEQ ID NO: 5, SEQ 
ID NO: 6 and SEQ ID NO: 8, or which comprises one or more regions which are 
1 5 heterologous between any two or more of SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 
and SEQ ID NO: 8. 

As a fourth aspect of the present invention, there is provided a nucleic acid capable 
of encoding a polypeptide according to the third aspect of the invention. 

We provide, according to a fifth aspect of the present invention, a vector 
20 comprising a nucleic acid according to the second or fourth aspect of the invention. 

The present invention, in a sixth aspect, provides a host cell comprising a nucleic 
acid according to the second or fourth aspect of die invention, or vector according to the 
fifth aspect of the invention. 
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In a seventh aspect of the present invention, there is provided a transgenic non- 
human animal comprising a nucleic acid according to the second or fourth aspect of the 
invention or a vector accordmg to the fifth aspect of the invention. Preferably, the 
transgenic non-human animal is a mouse. 

5 According to an eighth aspect of the present invention, we provide use of a 

polypeptide according to the first or third aspect of the invention in a method of 
identifying compound which is capable of interacting specifically with a G protein 
coupled receptor. 

We provide, according to a ninth aspect of the invention, use of a transgenic non- 
10 human animal according to the seventh aspect of the invention in a method of identifying a 
compound which is capable of interacting specifically with a G protein coupled receptor. 

There is provided, in accordance with a tenth aspect of the present invention, a 
method for identifying an antagonist of a Tarzan GPCR, the method comprising contacting 
a cell which expresses Tarzan receptor with a candidate compound and determining 
1 5 whether the level of cyclic AMP (cAMP) in said cell is lowered as a result of said 
contacting. 

As an eleventh aspect of the invention, we provide a method for identifying a 
compound capable of lowering the endogenous level of cyclic AMP in a cell which 
method comprises contacting a cell which expresses a Tarzan GPCR with a candidate 
20 compound and determining whether the level of cyclic AMP (cAMP) in said cell is 
lowered as a result of said contacting. 

According to a twelfth aspect of the invention, we provide a method for identifying 
a compound capable of binding to a Tarzan GPCR polypeptide, the method comprising 
contacting a Tarzan GPCR polypeptide with a candidate compound and detennimng 
25 whether the candidate compound binds to the Tarzan GPCR polypeptide. 
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We provide, according to a thirteenth aspect of the invention, there is provided a 
compound identified by a method according to any of the eighth to twelfth aspects of the 
invention. 

According to a fourteenth aspect of the present invention, we provide a compound 
5 capable of binding specifically to a polypeptide according to the first or third aspect of the 
invention. 

There is provided, according to a fifteenth aspect of the present invention, use of a 
polypeptide according to the first or third aspect of the invention, or part thereof; or a 
nucleic acid according to the second or fourth aspect of the invention, or part thereof, in a 
1 0 method for producing antibodies. 

We provide, accordmg to a sixteenth aspect of the present invention, an antibody 
capable of binding specifically to a polypeptide according to the fij^t or third aspect of the 
invention, or part thereof; or a polypeptide encoded by a nucleic acid according to the 
second or fourth aspect of the invention, or part thereof;. 

15 As a seventeenth aspect of the present invention, there is provided a 

pharmaceutical composition comprising any one or more of the following: a polypeptide 
according to the first or third aspect of the invention, or part thereof; a polypeptide 
encoded by a nucleic acid according to the second or fourth aspect of the invention, or part 
thereof; a vector according to the fifth aspect of the invention; a cell according to the sixth 

20 aspect of the invention; a compound according to the thirteenth or fourteenth aspect of the 
invention; and an antibody according to the sixteenth aspect of the invention, together with 
a pharmaceutically acceptable carrier or diluent. 



We provide, according to a eighteenth aspect of the present invention, a vaccine 
composition comprising any one or more of the following: a polypeptide according to the 
25 first or third aspect of the invention, or part thereof; a polypeptide encoded by a nucleic 
acid according to the second or fourth aspect of the invention, or part thereof; a vector 
according to the fifth aspect of the invention; a cell according to the sixth aspect of the 
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invention; a compound according to the thirteenth or fourteenth aspect of the invention; 
and an antibody according to the sixteenth aspect of the invention. 

According to an nineteenth aspect of the present invention, we provide a diagnostic 
kit for a disease or susceptibility to a disease comprising any one or more of the following: 
5 a polypeptide according to the first or third aspect of the invention, or part thereof; a 
polypeptide encoded by a nucleic acid according to the second or fourth aspect of the 
invention, or part thereof; a vector according to the fifth aspect of the invention; a cell 
according to the sixth aspect of the invention; a compound according to the thirteenth or 
fourteenth aspect of the invention; and an antibody according to the sixteenth aspect of the 
10 invention. 

We provide, according to a twentieth aspect of the invention, a method of treating 
a patient suffering firom a disease associated with enhanced activity of a Taizan GPCR, 
which method comprises administering to the patient an antagonist of Tarzan GPCR. 

There is provided, in accordance with a twenty-first aspect of the present 
1 5 invention, a method of treating a patient suffering firom a disease associated with reduced 
activity of a Tarzan GPCR, which method comprises administering to the patient an 
agonist of Tarzan GPCR. 

Preferably, the Tarzan GPCR comprises a polypeptide having the sequence shown 
in SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8. 

20 According to a twenty-second aspect of the present invention, we provide a method 

for treating and/or preventing a disease in a patient, which comprises the step of 
administering any one or more of the following to the patient: a polypeptide according to 
the first or third aspect df the invention, or part thereof; a polypeptide encoded by a 
nucleic acid according to the second or fourth aspect of the mvention, or part thereof; a 

25 vector according to the fifth aspect of the invention; a cell according to the sixth aspect of 
the invention; a compound according to the thirteenth or fourteenth aspect of the 
invention; and an antibody according to the sixteenth aspect of the invention; a 
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pharmaceutical composition according to the seventeenth aspect of the invention; and a 
vaccine according to the eighteenth aspect of the invention, to the subject 

There is provided, according to a twenty-third aspect of the present invention, an 
agent comprising a polypeptide according to the first or third aspect of the invention, or 
5 part thereof; a polypeptide encoded by a nucleic acid according to the second or fourth 
aspect of the invention, or part thereof; a vector according to the fifth aspect of the 
invention; a cell according to the sixth aspect of the invention; a compoimd according to 
the thirteenth or fourteenth aspect of the invention; and an antibody according to the 
sixteenth aspect of the invention, said agent for use in a method of treatment or 
10 prophylaxis of disease. 

We provide, according to a twenty-fourth aspect of the present invention, use of a 
polypeptide according to the first or third aspect of the invention, or part thereof; a 
polypeptide encoded by a nucleic acid according to the second or fourth aspect of the 
invention, or part thereof; a vector according to the fifth aspect of the invention; a cell 
1 5 according to the sixth aspect of the invention; a compoimd according to the thirteenth or 
fourteenth aspect of the invention; and an antibody according to the sixteenth aspect of the 
invention, for the preparation of a pharmaceutical composition for the treatment or 
prophylaxis of a disease. 

As a twenty-fifth aspect of the present invention, there is provided non-himian 
20 transgenic animal, characterized in that the transgenic animal comprises an altered Tarzan 
gene. Preferably, the alteration is selected from the group consisting of: a deletion of 
Tarzan, a mutation in Tarzan resulting in loss of function, introduction of an exogenous 
gene having a nucleotide sequence with targeted or random mutations into Tarzan, 
introduction of an exogenous gene from another species into Tarzan, and a combination of 
25 any of these. 

We provide, according to a twenty-sixth aspect of the present invention, a non- 
human transgenic animal having a functionally disrupted endogenous Tarzan gene, in 
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which the transgenic animal comprises in its genome and expresses a transgene encoding a 
heterologous Tarzan protein. 

The present invention, in a twenty-seventh aspect, provides a nucleic acid construct 
for functionally disrupting a Tarzan gene in a host cell, the nucleic acid construct 
5 comprising: (a) a non-homologous replacement portion; (b) a first homology region 
located upstream of the non-homologous replacement portion, the first homology region 
having a nucleotide sequence with substantial identity to a first Tarzan gene sequence; and 
(c) a second homology region located downstream of the non-homologous replacement 
portion, the second homology region having a nucleotide sequence v^th substantial 
1 0 identity to a second Tarzan gene sequence, the second Tarzan gene sequence having a 
location downstream of the first Tarzan gene sequence in a naturally occurring 
endogenous Tarzan gene. 

According to a twenty-eighth aspect of the present invention, we provide a process 
for producing a Tarzan GPCR polypeptide, the method comprising culturing a host cell 
1 5 according to the sixth aspect of the invention under conditions in which a nucleic acid 
encoding a Tarzan GPCR polypeptide is expressed. 

There is provided, according to a twenty-ninth aspect of the present invention, a 
method of detecting the presence of a nucleic acid according to the second or fourth aspect 
of the invention in a sample, the method comprising contacting the sample with at least 
20 one nucleic acid probe which is specific for said nucleic acid and monitoring said sample 
for the presence of the nucleic acid. 

We provide, according to a thirtieth aspect of the present invention, a method of 
detecting the presence of a polypeptide according to the first or third aspect of the 
invention in a sample, the method comprising contacting the sample v^th an antibody 
25 according to the sixteenth aspect of the invention and monitoring said sample for the 
presence of the polypeptide. 
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As a thirty-first aspect of the present invention, there is provided a method of 
diagnosis of a disease or syndrome caused by or associated with increased, decreased or 
otherwise abnormal expression of Tarzan GPCR, the method comprising the steps of: (a) 
detecting the level or pattern of expression of Tarzan GPCR in an animal suffering or 
5 suspected to be suffering from such a disease; and (b) comparing the level or pattern of 
expression with that of a normal animal 

Brief Description of the Drawings 

Figure 1 is a diagram showing the results of analysis of the human Tarzan 
polypeptide (SEQ ID NO: 3) using the HMM structural prediction software of pfam 
1 0 (http://www.sanger.ac.uk/Software/Pfam/search.shtml). 

Figure 2 is a diagram showing an expression profile for human Tarzan GPCR 
generated by reverse transcription-polymerase chain reaction (RT-PCR). 

Figure 3 is a diagram showing the structure of the mouse Tarzan locus before 
knock-out. 

1 5 Figure 4 is a diagram showing the structure of the mouse Tarzan locus after knock- 

out. 

Figure 5 is a diagram showing the structure of the vector used for mouse Tarzan 
knock-out, including cloning sites. 

Sequence Listings 

20 SEQ ID NO: 1 shows the cDNA sequence of human Tarzan. SEQ ID NO: 2 

shows an open reading fi:ame derived fcom SEQ ID NO: 1 . SEQ ID NO: 3 shows the 
amino acid sequence of human Tarzan. SEQ ID NO: 4 shows the open reading firame of a * 
cDNA for mouse Tarzan. SEQ ID NO: 5 shows the amino acid sequence of mouse 
Tarzan. 
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SEQ ID NO: 6 shows the amino acid sequence of a fusion protein of TARZAN 
with V5 and His tags at c-terminal end. SEQ ID NO: 7 shows a nucleotide sequence 
comprising Tarzan with Nhel and Xhol at the 5-prime and 3-prime ends, generated by 
PCR. SEQ ID NO: 8 shows the amino acid sequence of Tarzan fusion protein comprising 
5 a C terminal FLAG tag. SEQ ID NO: 10 shows the nucleotide sequence of a Tarzan 
genomic locus from 5*prF to 3*prR, including translated coding amino acid sequence. 

Where two sequences are set out under a particular sequence identifier (i.e., SEQ 
ID NO: NN), both sequences should understood to be encompassed by reference to that 
particular SEQ ID NO:. 



10 Detailed Description 
Tarzan GPCR 



Our invention relates in general to a novel G-Protein Coupled Receptor (GPCR), in 
particular, an orphan purinoceptor type G-protein coupled receptor, which we refer to as 
Tarzan GPCR, as well as homologues, variants or derivatives thereof. 

1 5 Tarzan is structurally related to other proteins of the G-protein coupled receptor ' 

family, as shown by the results of sequencing the amplified cDNA products encoding 
human Tarzan. The cDNA sequence of SEQ ID NO: 1 contains an open reading flame 
(SEQ ID NO: 2, nucleotide numbers 1 1 to 1022) encoding a polypeptide of 337 amino 
acids shown in SEQ ID NO: 3. Human Tarzan is found to map to hoino sapiens 

20 chromosome 13q32.2. 



Identities and Similarities to Tarzan 

The amino acid sequence of Tarzan has about 36 % identity and 58 % similarity 
(using BLAST) in 299 amino acid residues with human p2y purinoceptor 1 (p2y 1) 
(accession # CAA07339.1, Leon, C, Vial, C, Weber, J., Cazenave, J.P. and Gacher, C. 
25 direct Submission (15-JUN-1998) related accession numbers: NP_002554.1, P47900, 
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JC4737, AAA97872.1, AAA97873.1. AAB47091.1). The nucleotide sequence of Tarzan 
(SEQ ID NO: 1) has about 100% identity (using BLAST) in 178 nucleotide residues with 
the anonymous homo sapiens EST noraiai thyroid epithelium cdna from accession # 
AW827323 Robert Strausberg, NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap, May 18 
5 2000. 



Analysis of the Tarzan polypeptide (SEQ ID NO: 3) using the HMM structural 
prediction software of pfam fhttp://\v\vw,sanger,ac.uk/Software/Pfam/search.shtmn 
confirms that Tarzan peptide is a GPCR of the 7TM-1 structural class (see Figure 1). 

Human and mouse Tarzan GPCR are therefore members of a large family of G 
1 0 Protein Coupled Receptors (GPCRs). 

Expression Profile OF Tarzan 

Polymerase chain reaction (PCR) amplification of Tarzan cDNA detects 
expression of Tarzan to varying abundance in human spleen, heart, brain, kidney, lung, 
small intestine, prostate, ovary, muscle, and testis. An expression profile of Tarzan GPCR 

1 5 is shown in Figure 2. Using Tarzan cDNA of SEQ ID NO: 1 to search the human EST data 
sources by BLASTN, identities are found in cDNA derived from libraries originating from 
normal thyroid epithelium (accession # AW827323). This indicates that Tarzan is 
expressed in these normal or abnormal tissues. Accordingly, the Tarzan polypeptides, 
nucleic acids, probes, antibodies, expression vectors and ligands are useful for detection, 

20 diagnosis, treatment and other assays for diseases associated with over-, under- and 
abnormal expression of Tarzan GPCR in these and other tissues. 



Furthermore, the Tarzan polypeptides, nucleic acids, probes, antibodies, expression 
vectors and ligands are usefiil as (and for the production of) biosensors. According to 
Aizawa (1 988), AnaL Chem, Symp, 17: 683, a biosensor is defined as being a unique 
25 combination of a receptor for molecular recognition, for example a selective layer with 
inmiobilized antibodies or receptors such as a Tarzan G-protein coupled receptor, and a 
transducer for transmitting the values measured. One group of such biosensors will detect 
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the change which is caused in the optical properties of a surface layer due to the 
interaction of the receptor with the surrounding medium. Among such techniques may be 
mentioned especially ellipso-metry and surface plasmon resonance. Biosensors 
incorporating Tarzan may be used to detect the presence or level of Tarzan ligands, for 
5 example, nucleotides such as purines or purine analogues, or analogues of these ligands. 
The construction of such biosensors is well known in the art. Thus, cell lines expressing 
Tarzan receptor may be used as reporter systems for detection of ligands such as ATP via 
receptor-promoted formation of [3H]inositol phosphates or other second messengers (Watt 
et al., 1998, y Bia/ Chem May 29;273(22): 14053-8). Receptor-ligand biosensors are also 

10 described in Hofi&nan et al., 2000, Proc Natl Acad Sci USA Oct 10;97(21):1 1215-20. 
Optical and other biosensors comprising Tarzan may also be used to detect the level or 
presence of interaction with G-proteins and other proteins, as described by, for example, 
Figler et al, 1997, Biochemistry Dec 23;36(51): 16288-99 and Sarrio et al., 2000, Mol Cell 
Biol 2000 Jul;20(14):5 164-74). Sensor units for biosensors are described in, for example, 

15 US 5,492,840. 

This and other embodiments of the invention will be described in further detail 

below. 

Methods Employed 

The. practice of the present invention will employ, unless otherwise indicated, 
20 conventional techniques of chemistry, molecular biology, microbiology, recombinant 
DNA and inunxmology, which are within the capabilities of a person of ordinary skill in 
the art. Such techniques are explained in the literature. See, for example, J. Sambrook, E. 
F. Fritsch, and T. Maniatis, 1989, Molecular Cloning: A Laboratory Manual, Second 
Edition, Books 1-3, Cold Spring Harbor Laboratory Press; Ausubel, F. M. et al, (1995 and 
25 periodic supplements; Current Protocols in Molecular Biology, ch. 9, 13, and 16, John 
Wiley & Sons, New York, N.Y.); B. Roe, J. Crabtree, and A. Kahn, 1996. DNA Isolation 
and Sequencing: Essential Techniques, John Wiley & Sons; J. M. Polak and James 0*D. 
McGee, 1990, In Situ Hybridization: Principles and Practice; Oxford University Press; M. 
J. Gait (Editor), 1984, Oligonucleotide Synthesis: A Practical Approach, Irl Press; and, D. 



wo 02/088183 PCT/GB02/01923 

14 

M. J. Lilley and J. E. Dahlberg, 1992, Methods of Enzymology: DNA Structure Part A: 
Synthesis and Physical Analysis of DNA Methods in Enzymology, Academic Press. Each 
of these general texts is herein incorporated by referefnce. 

Tarzan GPCR Polypeptides 

5 As used here, the term "Tarzan GPCR polypeptide" is intended to refer to a 

polypeptide comprising the amino acid sequence shown in SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 6 or SEQ ID NO: 8, or a homologue, variant or derivative thereof 
Preferably, the polypeptide comprises or is a homologue, variant or derivative of the 
sequence shown in SEQ ID NO: 3. 

10 "Polypeptide" refers to any peptide or protein comprising two or more amino acids 

joined to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. 
"Polypeptide" refers to both short chains, commonly referred to as peptides, oligopeptides 
or oligomers, and to longer chains, generally referred to as proteins. Polypeptides may 
contain amino acids other than the 20 gene-encoded amino acids. 

15 "Polypeptides" include amino acid sequences modified either by natural processes, 

such as post-translational processing, or by chemical modification techniques which are 
well known in the art. Such modifications are well described in basic texts and in more 
detailed monographs, as well as in a voluminous research literature. Modifications can 
occur anjrwhere in a polypeptide, including the peptide backbone, the amino acid side- 

20 chains and the amino or carboxyl termini. It will be appreciated that the same type of 
modification may be present in the same or varying degrees at several sites in a given 
polypeptide. Also, a given polypeptide may contain many types of modifications. 

Polypeptides may be branched as a result of ubiquitination, and they may be cyclic, 
with or without branching. Cyclic, branched and branched cyclic polypeptides may result 
25 fix>m posttranslation natural processes or may be made by synthetic methods. 

Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of aheme moiety, covalent attachment of a 
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nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-inking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-inks, formation of cystine, 
formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI 
5 anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, 
sulfation, transfer-RNA mediated addition of amino acids to proteins such as arginylation, 
and ubiquitination. See, for instance, Proteins - Structure and Molecular Properties^ 2nd 
Ed., T. E. Creighton, W. H. Freeman and Company, New York, 1993 and Wold, F., 
10 Posttranslational Protein Modifications: Perspectives and Prospects^ pgs. 1-12 in 

Posttranslational Covalent Modification of Proteins^ B. C. Johnson, Ed., Academic Press, 
New York, 1983; Seifter et al., "Analysis for protein modifications and nonprotein 
cofactors", Meth Enzymol (1990) 182:626-646 and Rattan et aL, "Protein Synthesis: 
Posttranslational Modifications and Aging", Ann NYAcadSci (1992) 663:48-62. 

1 5 The terms 'Variant" "homologue", "derivative" or "fii^gment" in relation to the 

present invention include any substitution of, variation of, modification of, replacement of, 
deletion of or addition of one (or more) amino acid firom or to a sequence. Unless the 
context admits otherwise, references to "Tarzan" and "Tarzan GPCR" include references 
to such variants, homologues, derivatives and firagments of Tarzan. 

20 Preferably, as applied to Tarzan, the resultant amino acid sequence has GPCR 

activity, more preferably having at least the same activity of the Tarzan GPCR shown as 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8. In particular, the term 
"homologue" covers identity with respect to structure and/or fimction providing the 
resultant amino acid sequence has GPCR activity. With respect to sequence identity (i.e. 

25 similarity), preferably there is at least 70%, more preferably at least 75%, more preferably 
at least 85%, even more preferably at least 90% sequence identity. More preferably there 
is at least 95%, more preferably at least 98%, sequence identity. These terms also 
encompass polypeptides derived firom amino acids which are allelic variations of the 
Tarzan GPCR nucleic acid sequence. 
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Where reference is made to the '^receptor activity" or "biological activity" of a 
receptor such as Tarzan GPCR, these tenns are intended to refer to the metabolic or 
physiological function of the Tarzan receptor, including similar activities or improved 
activities or these activities with decreased undesirable side effects. Also included are 
5 antigenic and immimogenic activities of the Tarzan receptor. Examples of GPCR activity, 
and methods of assaying and quantifying these activities, are known in the art, and are 
described in detail elsewhere in this document. 

As used herein a "deletion" is defined as a change in either nucleotide or amino 
acid sequence in which one or more nucleotides or amino acid residues, respectively, are 
1 0 absent. As used herein an "insertion" or "addition" is that change in a nucleotide or amino 
acid sequence which has resulted in the addition of one or more nucleotides or amino acid 
residues, respectively, as compared to the naturally occurring substance. As used herein 
"substitution" results from the replacement of one or more nucleotides or amino acids by 
different nucleotides or amino acids, respectively. 

1 5 Tarzan polypeptides according to the present invention may also have deletions, 

insertions or substitutions of amino acid residues which produce a silent change and result 
in a fimctionally equivalent amino acid sequence. Deliberate amino acid substitutions may 
be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity, and/or the amphipathic nature of the residues. For example, negatively 

20 charged amino acids include aspartic acid and glutamic acid; positively charged amino 
acids include lysine and arginine; and amino acids with uncharged polar head groups 
having similar hydrophilicity values include leucine, isoleucine, valine, glycine, alanine, 
asparagine, glutamine, serine, threonine, phenylalanine, and tyrosine. 

Conservative substitutions may be made, for example according to the table below. 
25 Amino acids in the same block in the second column and preferably in the same line in the 
third colimm may be substituted for each other: 



ALIPHATIC 


Non-polar 


GAP 






ILV 




Polar - uncharged 


CSTM 






NQ 1 
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Polar - charged 


DE 


KR 


AROMATIC 




HFWY 



Tarzan polypeptides of the invention may further comprise heterologoxis amino 
acid sequences, typically at the N-terminus or C-terminus, preferably the N-terminus. 
Heterologous sequences may include sequences that affect intra or extracellular protein 
targeting (such as leader sequences). Heterologous sequences may also include sequences . 
5 that increase the immunogenicity of the polypeptide of the invention and/or which 
facilitate identification, extraction and/or purification of the polypeptides. Another 
heterologous sequence that is particularly preferred is a polyamino acid sequence such as 
polyhistidine which is preferably N-terminal. A polyhistidine sequence of at least 10 
amino acids, preferably at least 17 amino acids but fewer than 50 amino acids is especially 
10 preferred. 

The Tarzan GPCR polypeptides may be in the form of the "mature" protein or may 
be a part of a larger protein such as a fusion protein. It is often advantageous to include an 
additional amino acid sequence which contains secretory or leader sequences, pro- 
sequences, sequences which aid in purification such as multiple histidine residues, or an 
1 5 additional sequence for stability during recombinant production. 

Tarzan polypeptides of the invention are advantageously made by recombinant 
means, using known techniques. However they may also be made by synthetic means 
using techniques well known to skilled persons such as solid phase synthesis. Polypeptides 
of the invention may also be produced as fusion proteins, for example to aid in extraction 

20 and purification. Examples of fusion protein partners include glutathione-S-transferase 
(GST), 6xHis, GAL4 (DNA binding and/or transcriptional activation domains) and p- 
galactosidase. It may also be convenient to include a proteolytic cleavage site between the 
fusion protein partner and the protein sequence of interest to allow removal of fusion 
protein sequences, such as a thrombin cleavage site. Preferably the fusion protein will not 

25 hinder the function of the protem of interest sequence. 
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Tarzan polypeptides of the invention may be in a substantially isolated form. This 
term is intended to refer to alteration by the hand of man from the natural state. If an 
"isolated" composition or substance occurs in nature, it has been changed or removed 
from its original environment, or both. For example, a polynucleotide, nucleic acid or a 
5 polypeptide naturally present in a living animal is not "isolated," but the same 

polynucleotide, nucleic acid or polypeptide separated from the coexisting materials of its 
natural state is "isolated", as the term is employed herein. 

It will however be understood that the Tarzan GPCR protein may be mixed with 
carriers or diluents which will not interfere with the intended purpose of the protein and 
10 still be regarded as substantially isolated. A polypeptide of the invention may also be in a 
substantially purified form, in which case it will generally comprise the protein in a 
preparation in which more than 90%, for example, 95%, 98% or 99% of the protein in the 
preparation is a Tarzan GPCR polypeptide of the invention. 

The present invention also relates to peptides comprising a portion of a Tarzan 
1 5 polypeptide according to the invention. Thus, fragments of Tarzan GPCR and its 

homologues, variants or derivatives are included. The peptides of the present invention 
may be between 2 and 200 amino acids, preferably between 4 and 40 amino acids in 
length. The peptide may be derived from a Tarzan GPCR polypeptide as disclosed here, 
for example by digestion with a suitable enzyme, such as trypsin. Alternatively the 
20 peptide, fragment, etc may be made by recombinant means, or synthesised synthetically. 

The term "peptide" includes the various synthetic peptide variations known in the 
art, such as a retroinverso D peptides. The peptide may be an antigenic determinant and/or 
a T-cell epitope. The peptide may be immunogenic in vivo. Preferably the peptide is 
capable of inducing neutralising antibodies in vivo. 

25 By aligning Tarzan GPCR sequences from different species, it is possible to 

determine which regions of the amiiio acid sequence are conserved between different 
species ("homologous regions"), and which regions vary between the different species 
("heterologous regions'^- 
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The Tarzan polypeptides according to the invention may therefore comprise a 
sequence which corresponds to at least part of a homologous region. A homologous region 
shows a high degree of homology between at least two species. For example, the 
homologous region may show at least 70%, preferably at least 80%, more preferably at 
5 least 90%, even more preferably at least 95% identity at the amino acid level using the 
tests described above. Peptides which comprise a sequence which corresponds to a 
homologous region may be used in therapeutic strategies as explained in further detail 
below. Alternatively, the Tarzan GPCR peptide may comprise a sequence which 
corresponds to at least part of a heterologous region. A heterologous region shows a low 
1 0 degree of homology between at least two species. 

Tarzan GPCR Polynucleotides and Nucleic Acids 

This invention encompasses Tarzan polynucleotides, Tarzan nucleotides and 
Tarzan nucleic acids, methods of production, uses of these, etc, as described in further 
detail elsewhere in this document. 

15 The terms "Tarzan polynucleotide", "Tarzan nucleotide" and "Tarzan nucleic acid" 

may be used interchangeably, and are intended to refer to a polynucleotide/nucleic acid 
comprising a nucleic acid sequence as shown in SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 7 or SEQ ID NO: 10, or a homologue, variant or derivative thereof. 
Preferably, the polynucleotide/nucleic acid comprises or is a homologue, variant or 

20 derivative of the nucleic acid sequence SEQ ID NO: 1 or SEQ ID NO: 2, most preferably, 
SEQ ID NO: 2. 

These terms are also intended to include a nucleic acid sequence capable of 
encoding a polypeptides and/or a peptide of the present invention, i.e., a Tarzan 
polypeptide. Thus, Tarzan GPCR polynucleotides and nucleic acids comprise a nucleotide 
2S sequence capable of encoding a polypeptide comprising the amino acid sequence shown in 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, or a homologue, variant 
or derivative thereof Preferably, the Tarzan GPCR polynucleotides and nucleic acids 
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comprise a nucleotide sequence capable of encoding a polypeptide comprising the amino 
acid sequence shown in SEQ ID NO: 3, or a homologue, variant or derivative thereof. 

"Polynucleotide" generally refers to any polyribonucleotide or 
polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or 
5 DNA. "Polynucleotides" include, without limitation single- and double-stranded DNA, 
DNA that is a mixture of single- and double-stranded regions, single- and double-stranded 
RNA, and RNA that is mixture of single- and double-stranded regions, hybrid molecules 
comprising DNA and RNA that may be single-stranded or, more typically, double- 
stranded or a mixture of single- and double-stranded regions. In addition, "polynucleotide" 

1 0 refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA. The 
term polynucleotide also includes DNAs or RNAs containing one or more modified bases 
and DNAs or RNAs with backbones modified for stability or for other reasons. 
"Modified" bases include, for example, tritylated bases and unusual bases such as inosine. 
A variety of modifications has been made to DNA and RNA; thus, "polynucleotide" 

15 embraces chemically, enzymatically or metabolically modified forms of polynucleotides 
as typically found in nature, as well as the chemical forms of DNA and RNA characteristic 
of viruses and cells. "Polynucleotide" also embraces relatively short polynucleotides, often 
referred to as oligonucleotides. 

It will be understood by the skilled person that numerous nucleotide sequences can 
20 encode the same polypeptide as a result of the degeneracy of the genetic code. 

As used herein, the term "nucleotide sequence" refers to nucleotide sequences, 
oligonucleotide sequences, polynucleotide sequences and variants, homologues, fiagments 
and derivatives thereof (such as portions thereof). The nucleotide sequence may be DNA or 
RNA of genomic or synthetic or recombinant origin which may be double-stranded or 
2S single-stranded whether representing the sense or antisense strand or combinations thereof. 
The term nucleotide sequence may be prepared by use of recombinant DNA techniques 
(for example, recombinant DNA). 



Preferably, the term "nucleotide sequence" means DNA. 
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The terms 'Variant", "homologue", "derivative" or "fragment" in relation to the 
present invention include any substitution of, variation of, modification of, replacement of, 
deletion of or addition of one (or more) nucleic acids fi-om or to the sequence of a Tarzan 
nucleotide sequence. Unless the context admits otherwise, references to "Tarzan" and 
5 "Tarzan GPCR" include references to such variants, homologues, derivatives and 
firagments of Tarzan. 

Preferably, the resultant nucleotide sequence encodes a polypeptide having GPCR 
activity, preferably having at least the same activity of the GPCR shown as SEQ ID NO: 
3, SEQ ID NO: 5, SEQ K) NO: 6 or SEQ ID NO: 8. Preferably, the term 'Tiomologue" is 

10 intended to cover identity with respect to structure and/or function such that the resultant 
nucleotide sequence encodes a polypeptide which has GPCR activity. With respect to 
sequence identity (i.e. similarity), preferably there is at least 70%, more preferably at least 
75%, more preferably at least 85%, more preferably at least 90% sequence identity. More 
preferably there is at least 95%, more preferably at least 98%, sequence identity. These 

1 5 terms also encompass allelic variations of the sequences. 

Tarzan Control Regions 

For some purposes, it may be necessary to utilise or investigate control regions of 
Tarzan. Such control regions include promoters, enhancers and locus control regions. By a 
control region we mean a nucleic acid sequence or structure which is capable of 
20 modulating the expression of a coding sequence which is operatively linked to it. 

For example, control regions are useful in generating transgenic animals 
expressing Tarzan. Furthermore, control regions may be used to generate expression 
constructs for Tarzan. Control regions from different individuals in a population may be 
sequenced to identify non-coding polymorphisms, which may affect the expression level 
25 of Tarzan. This is described in further detail below. 

Identification of control regions of Tarzan is straightforward, and may be carried 
out in a number of ways. For example, the coding sequence of Tarzan may be obtained 



wo 02/088183 



PCT/GB02/01923 



22 

from an organism, by screening a cDNA library using a human or mouse Tarzan cDNA 
sequence as a probe. 5' sequences may be obtained by screening an appropriate genomic 
library, or by primer extension as known in the art. Database searching of genome 
databases may also be employed. Such 5' sequences which are particularly of interest 
5 include non-coding regions. The 5* regions may be examined by eye, or with the aid of 
computer programs, to identify sequence motifs which indicate the presence of promoter 
and/or enhancer regions. 

Furthermore, sequence alignments may be conducted of Tarzan nucleic acid 
sequences from two or more organisms. By aligning Tarzan GPCR sequences from 

1 0 different species, it is possible to determine which regions of the nucleotide sequence are 
conserved between different species. Such conserved regions are likely to contain control 
regions for the gene in question (i.e., Tarzan). The mouse and human genomic sequences 
as disclosed here, for example, a mouse Tarzan genomic sequence (SEQ ID NO: 10), may 
be employed for this purpose. Furthermore, Tarzan hbmologues from other organisms may 

IS be obtained using standard methods of screening using appropriate probes generated from 
the mouse and human Tarzan sequences. The genome of the pufferfish {Takifugu rubripes) 
may also be screened to identify a Tarzan homologue; comparison of the 5' non-coding 
region of the Fugu Tarzan gene vdth a mouse or human genomic Tarzan sequence (e.g., 
SEQ ID NO: 10, mouse Tarzan genomic sequence) may be used to identify conserved 

20 regions containing control regions. 

Deletion studies may also be conducted to identify promoter and/or enhancer 
regions for Tarzan. 

The identity of putative control regions may be confirmed by molecular biology 
experiments, in which the candidate sequences are linked to a reporter gene and the 
25 expression of the reporter detected. 
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Calculation of Sequence Homology 

Sequence identity with respect to any of the sequences presented here can be 
determined by a simple "eyeball" comparison (i.e. a strict comparison) of any one or more 
of the sequences with another sequence to see if that other sequence has, for example, at 
5 least 70% sequence identity to the sequence(s). 

Relative sequence identity can also be determined by conunercially available 
computer programs that can calculate % identity between two or more sequences using 
any suitable algorithm for determining identity, using for example default parameters. A 
typical example of such a computer program is CLUSTAL. Other computer program 
10 methods to determine identify and similarity between the two sequences include but are 
not limited to the GCG program package (Devereux et al 1984 Nucleic Acids Research 
12: 387) and FASTA (Atschul et al 1990 J Molec Biol 403-410). 



% homology may be calculated over contiguous sequences, i.e. one sequence is 
aligned with the other sequence and each amino acid in one sequence is directly compared 
1 5 with the corresponding amino acid in the other sequence, one residue at a time. This is called 
an '"ungapped" aligrunent. Typically, such ungapped alignments are performed only over a 
relatively short number of residues. 

Although this is a very simple and consistent method, it fails to take into 
consideration that, for example, in an otherwise identical pair of sequences, one insertion or 

20 deletion will cause the following amino acid residues to be put out of alignment, thus 
potentially resulting in a large reduction in % homology when a global alignment is 
performed. Consequently, most sequence comparison methods are designed to produce 
optimal alignments that take into consideration possible insertions and deletions without 
penalising unduly the overall homology score. This is achieved by inserting "gaps" in the 

25 sequence alignment to try to maximise local homology. 

However, these more complex methods assign "gap penalties" to each gap that occurs 
in the alignment so that, for the same number of identical amino acids, a sequence alignment 
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with as few gaps as possible - reflecting higher relatedness between the two compared 
sequences - will achieve a higher score than one with many gaps. "AfFine gap costs" are 
typically used that charge a relatively high cost for the existence of a gap and a smaller 
penalty for each subsequent residue in the gap. This is the most commonly used gap scoring 
5 system. High gap penalties will of course produce optimised alignments with fewer gaps. 
Most alignment programs allow the gap penalties to be modified. However, it is preferred to 
use the defauh values when using such software for sequence comparisons. For example, 
when using the GCG Wisconsin Bestfit package the default gap penalty for amino acid 
sequences is -12 for a gap and -4 for each extension. 

1 0 Calculation of maximum % homology therefore firstly requires the production of an 

optimal alignment, taking into consideration gap penalties. A suitable computer program for 
carrying out such an alignment is the GCG Wisconsin Bestfit package (University of 
Wisconsin, U.S. A.; Devereux et aL, 1984, Nucleic Acids Research 12:387). Examples of 
other software than can perform sequence comparisons include, but are not limited to, the 

15 BLAST package (Ausubel et al, 1999 ibid- Chapter 1 8), FASTA (Atschul et aL, 1990, J. 
Mol. Biol., 403-410) and the GENEWORKS suite of comparison tools. Both BLAST and 
FASTA are available for offline and online searching (Ausubel et aL, 1999 ibid, pages 7- 
58 to 7-60). 

Although the final % homology can be measured in terms of identity, the 
20 alignment process itself is typically not based on an all-or-nothing pair comparison. 
Instead, a scaled similarity score matrix is generally used that assigns scores to each 
pairwise comparison based on chemical similarity or evolutionary distance. An example of 
such a matrix connunonly used is the BLOSUM62 matrix - the default matrix for the 
BLAST suite of programs. GCG Wisconsin programs generally use either the public 
25 default values or a custom symbol comparison table if supplied. It is preferred to use the 
public default values for the GCG package, or in the case of other software, the default 
matrix, such as BLOSUM62, 

Advantageously, the BLAST algorithm is employed, with parameters set to defauh 
values. The BLAST algorithm is described in detail at 
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http://vmw.ncbi.nih.gov/BLAST/blast_help.html, which is incorporated herem by 
reference. The search parameters are defined as follows, can be advantageously set to the 
defined default parameters. 

Advantageously, "substantial identity" when assessed by BLAST equates to 
5 sequences which match with an EXPECT value of at least about 7, preferably at least 
about 9 and most preferably 10 or more. The default threshold for EXPECT in BLAST 
searching is usually 10. 

BLAST (Basic Local Alignment Search Tool) is the heuristic search algorithm 
employed by the programs blastp, blastn, blastx, tblastn, and tblastx; these programs 
1 0 ascribe significance to their findings using the statistical methods of Karlin and Altschul 
(Karlin and Altschul 1990, Proc, Natl Acad Set USA 87:2264-68; Karlin and Altschul, 
1993, Proc. Natl Acad Set USA 90:5873-7; see 

http://www.ncbi.nih.gov/BLAST^last_help.html) with a few enhancements. The BLAST 
programs are tailored for sequence similarity searching, for example to identify 
1 5 homologues to a query sequence. For a discussion of basic issues in similarity searching of 
sequence databases, see Altschul et al (1994) Nature Genetics 6:1 19-129. 

The five BLAST programs available at http://www.ncbi.nlm.nih.gov perform the 
following tasks: blastp - compares an amino acid query sequence against a protein 
sequence database; blastn - compares a nucleotide query sequence against a nucleotide 

20 sequence database; blastx - compares the six-fiame conceptual translation products of a 
nucleotide query sequence (both strands) against a protein sequence database; tblastn - 
compares a protein query sequence against a nucleotide sequence database dynamically 
translated in all six reading frames (both strands); tblastx - compares the six-frame 
translations of a nucleotide query sequence against the six-fi^e translations of a 

25 nucleotide sequence database. 



BLAST uses the following search parameters: 
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HISTOGRAM - Display a histogram of scores for each search; default is yes. (See 
parameter H in the BLAST Manual). 

DESCRIPTIONS - Restricts the number of short descriptions of matching 
sequences reported to the number specified; default limit is 100 descriptions. (See 
5 parameter V in the manual page). 

EXPECT - The statistical significance threshold for reporting matches against 
database sequences; the default value is 10, such that 10 matches are expected to be found 
merely by chance, according to the stochastic model of Karlin and Altschul (1990). If the 
statistical significance ascribed to a match is greater than the EXPECT threshold, the 
1 0 match will not be reported. Lower EXPECT thresholds are more stringent, leading to 
fewer chance matches being reported. Fractional values are acceptable. (See parameter E 
in the BLAST Manual). 

CUTOFF - Cutoff score for reportmg high-scoring segment pairs. The defauU 
value is calculated from the EXPECT vaiue (see above). HSPs are reported for a database 
15 sequence only if the statistical significance ascribed to them is at least as high as would be 
ascribed to a lone HSP having a score equal to the CUTOFF value. Higher CUTOFF 
values are more stringent, leading to fewer chance matches being reported. (See parameter 
S in the BLAST Manual). Typically, significance thresholds can be more intuitively 
managed using EXPECT. 

20 ALIGNMENTS - Restricts database sequences to the number specified for which 

high-scoring segment pairs (HSPs) are reported; the default limit is SO. If more database 
sequences than this happen to satisfy the statistical significance threshold for reporting 
(see EXPECT and CUTOFF below), only the matches ascribed the greatest statistical 
significance are reported. (See parameter B in the BLAST Manual). 

25 MATRIX - Specify an alternate scoring matrix for BLASTP, BLASTX, 

TBLASTN and TBLASTX. The default matrix is BLOSUM62 (Henikoff & Henikoff, 
1992). The valid alternative choices include: PAM40, PAM120, PAM250 and 
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IDENTITY. No alternate scoring matrices are available for BLASTN; specifying the 
MATRIX directive in BLASTN requests returns an error response. 

STRAND - Restrict a TBLASTN search to just the top or bottom strand of the 
database sequences; or restrict a BLASTN, BLASTX or TBLASTX search to just reading 
5 frames on the top or bottom strand of the query sequence. 

FILTER - Mask off segments of the query sequence that have low compositional 
complexity, as determined by the SEG program of Wootton & Federhen (1993) 
Computers and Chemistry 17:149-163, or segments consisting of short-periodicity internal 
repeats, as determined by the XNU program of Claverie & States (1 993) Computers and 
10 Chemistry 1 7:191-201, or, for BLASTN, by the DUST program of Tatusov and Lipman 
(see http://www.ncbi.nlm.nih.gov). Filtering can eliminate statistically significant but 
biologically uninteresting reports from the blast output (e.g., hits against common acidic-, 
basic- or proline-rich regions), leaving the more biologically interesting regions of the 
query sequence available for specific matching against database sequences. 

15 Low complexity sequence found by a filter program is substituted using the letter 

"N" in nucleotide sequence (e.g., ' WNNNNNNNNNNN") and the letter "X" in protein 
sequences (e.g., "XXXXXXXXX"). 

Filtering is only applied to the query sequence (or its translation products), not to 
database sequences. Default filtering is DUST for BLASTN, SEG for other programs. 

20 It is not unusual for nothing at all to be masked by SEG, XNU, or both, when 

applied to sequences in SWISS-PROT, so filtering should not be expected to always yield 
an effect. Furthermore, in some cases, sequences are masked in their entirety, indicating 
that the statistical significance of any matches reported against the unfiltered query 
sequence should be suspect. 

25 NCBI-gi - Causes NCBI gi identifiers to be shown in the output, in addition to the 

accession and/or locus name. 
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Most preferably, sequence comparisons are conducted using the simple BLAST 
search algorithm provided at http://www.ncbi.nlm.nih.gov/BLAST. In some embodiments 
of the present invention, no gap penalties are used when determining sequence identity. 

Hybridisation 

5 The present invention also encompasses nucleotide sequences that are capable of 

hybridising to the sequences presented herein, or any fragment or derivative thereof, or to 
the complement of any of the above. 

Hybridization means a "process by which a strand of nucleic acid joins with a 
complementary strand through base pairing" (Coombs J (1994) Dictionary of 
1 0 Biotechnology, Stockton Press, New York NY) as well as the process of amplification as 
carried out in polymerase chain reaction technologies as described in Dieffenbach CW and 
GS Dveksler (1995, PGR Primer, a Laboratory Manual, Cold Spring Harbor Press, 
PlainviewNY). 

Hybridization conditions are based on the melting temperature (Tm) of the nucleic 
15 acid binding complex, as taught in Berger and Kimmel (1987, Guide to Molecular Cloning 
Techniques, Methods in Enzymology, Vol 152, Academic Press, San Diego CA), and 
confer a defined "stringency" as explained below. 

Nucleotide sequences of the invention capable of selectively hybridising to the 
nucleotide sequences presented herein, or to their complement, will be generally at least 70%, 

20 preferably at least 75%, more preferably at least 85 or 90% and even more preferably at least 
95% or 98% homologous to the correspondmg nucleotide sequences presented herein over a 
region of at least 20, preferably at least 25 or 30, for instance at least 40, 60 or 100 or more 
contiguous nucleotides. Preferred nucleotide sequences of die invention will comprise 
regions homologous to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 7 or 

25 SEQ ID NO: 10, preferably at least 70%, 80% or 90% and more preferably at least 95% 
homologous to one of the sequences. 
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The tenn "selectively hybridizable" means that the nucleotide sequence used as a 
probe is used under conditions where a target nucleotide sequence of the invention is found to 
hybridize to the probe at a level significantly above background. The background 
hybridization may occur because of other nucleotide sequences present, for example, in the 
5 cDNA or genomic DNA library being screened. In this event, background implies a level of 
signal generated by interaction between the probe and a non-specific DNA member of the 
library which is less than 10 fold, preferably less than 100 fold as intense as the specific 
interaction observed with the target DNA. The intensity of interaction may be measured, for 
example, by radiolabelling tiie probe, e.g. with ^^P. 

1 0 Also included within the scope of the present invention are nucleotide sequences 

that are capable of hybridizing to the nucleotide sequences presented herein under 
conditions of intermediate to maximal stringency. Hybridization conditions are based on 
the melting temperature (Tm) of the nucleic acid binding complex, as taught in Berger and 
Kimmel (1987, Guide to Molecular Cloning Techniques, Methods in Enzymology, Vol 

15 1 52, Academic Press, San Diego CA), and confer a defined "stringency" as explained 
below. 

Maximum stringency typically occurs at about Tm-5^C (5*^C below the Tm of the 
probe); high stringency at about 5°C to \0°C below Tm; intermediate stringency at about 
lOX to 20^C below Tm; and low stringency at about 20*^0 to 25^C below Tm, As will be 
20 understood by those of skill in the art, a maximum stringency hybridization can be used to 
identify or detect identical nucleotide sequences while an intermediate (or low) stringency 
hybridization can be used to identify or detect similar or related nucleotide sequences. 

In a preferred embodiment, the present invention covers nucleotide sequences that 
can hybridise to one or more of the Tarzan GPCR nucleotide sequences of the present 
25 invention under stringent conditions (e.g. 65**C and O.lxSSC {IxSSC = 0.1 5 M NaCl, 0.015 
M Naa Citrate pH 7.0). Where the nucleotide sequence of the invention is double-stranded, 
both strands of the duplex, either individually or in combination, are encompassed by the 
present invention. Where the nucleotide sequence is single-stranded, it is to be understood 
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that the complementary sequence of that nucleotide sequence is also included within the 
scope of the present invention. 

The present invention also encompasses nucleotide sequences that are capable of 
hybridising to the sequences that are complementary to the sequences presented herein, or 
5 any fragment or derivative thereof Likewise, the present invention encompasses 

nucleotide sequences that are complementary to sequences that are capable of hybridising 
to the sequence of the present invention. These types of nucleotide sequences are 
examples of variant nucleotide sequences. In this respect, the term 'Variant" encompasses 
sequences that are complementary to sequences that are capable of hydridising to the 
10 nucleotide sequences presented herein. Preferably, however, the term "variant" 
encompasses sequences that are complementary to sequences that are capable of 
hydridising under stringent conditions (eg. 65°C and O.lxSSC {IxSSC = 0.15 MNaCl, 
0.015 Na3 citrate pH 7.0}) to the nucleotide sequences presented herein. 

Cloning of Tarzan GPCR and Homologues 

1 5 The present invention also encompasses nucleotide sequences that are 

complementary to the sequences presented here, or any fragment or derivative thereof If 
the sequence is complementary to a fragment thereof then that sequence can be used as a 
probe to identify and clone similar GPCR sequences in other organisms etc. 

The present invention thus enables the cloning of Tarzan GPCR, its homologues 
20 and other structurally or functionally related genes from human and other species such as 
mouse, pig, sheep, etc to be accomplished. Polynucleotides of the invention, which are 
identical or sufficiently identical to a nucleotide sequence contained m SEQ ID NO: 1, 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 7 or SEQ ID NO: 10 or a fragment thereof, 
may be used as hybridization probes for cDNA and genomic DNA, to isolate partial or 
25 full-length cDNAs and genomic clones encoding Tarzan GPCR from appropriate libraries. 
Such probes may also be used to isolate cDNA and genomic clones of other genes 
(including genes encoding homologues and orthologues from species other than human) 
that have sequence similarity, preferably high sequence similarity, to the Tarzan GPCR 
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gene. Hybridization screening, cloning and sequencing techniques are known to those of 
skill in the art and are described in, for example, Sambrook et al (supra). 

Typically nucleotide sequences suitable for use as probes are 70% identical, 
preferably 80% identical, more preferably 90% identical, even more preferably 95% 
5 identical to that of the referent. The probes generally will comprise at least 1 5 nucleotides. 
Preferably, such probes will have at least 30 nucleotides and may have at least 50 
nucleotides. Particularly preferred probes will range between 1 50 and 500 nucleotides, 
more particularly about 300 nucleotides. 

In one embodiment, to obtain a polynucleotide encoding a Tarzan GPCR 
10 polypeptide, including homologues and orthologues from species other than human, 
comprises the steps of screening an appropriate library under stringent hybridization 
conditions witii a labelled probe having the SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 
4, SEQ ID NO: 7 or SEQ ID NO: 10 or a fragment thereof and isolating partial or full- 
length cDNA and genomic clones containing said polynucleotide sequence. Such 
1 5 hybridization techniques are well known to those of skill in the art. Stringent hybridization 
conditions are as defined above or alternatively conditions under overnight incubation at 
42 degrees C. in a solution comprising: 50% formamide, 5XSSC (150 mM NaCl, 15mM 
trisodium citrate), 50 mM sodium phosphate (pH7.6), 5XDenhardt's solution, 10% 
dextran sulphate, and 20 microgram/ml denatured, sheared salmon sperm DNA, followed 
20 by washing the filters in 0. IXSSC at about 65 degrees C. 

Functional Assay for Tarzan GPCR 

The cloned putative Tarzan GPCR polynucleotides may be verified by sequence 
analysis or functional assays. For example, the putative Tarzan GPCR or homologue may 
be assayed for receptor activity as follows. Capped RNA transcripts fi-om linearized 
25 plasmid templates encoding the Tarzan receptor cDNAs of the invention are synthesized 
in vitro vsdth RNA polymerases in accordance with standard procedures. In vitro 
transcripts are suspended in water at a final concentration of 0.2 mg/ml. Ovarian lobes are 
removed firpm adult female toads. Stage V defolliculated oocytes are obtained, and RNA 
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transcripts (10 ng/oocyte) are injected in a SO nl bolus using a microinjection apparatus. 
Two electrode voltage clamps are used to measure the currents from individual Xenopus 
oocytes in response to agonist exposure. Recordings are made in Ca^"^ free Earth's 
medium at room temperature. The Xenopus system may also be used to screen known 
5 ligands and tissue/cell extracts for activating ligands, as described in further detail below. 

Expression Assays forTarzan GPCR 

In order to design useful therapeutics for treating Tarzan GPCR associated 
diseases, it is useful to determine the expression profile of Tarzan (whether wild-type or a 
particular mutant). Thus, methods known in the art may be used to determine the organs, 
1 0 tissues and cell types (as well as the developmental stages) in which Tarzan is expressed. 
For example, traditional or "electronic*' Northerns may be conducted. Reverse- 
transcriptase PGR (RT-PCR) may also be employed to assay expression of the Tarzan 
gene or mutant. More sensitive methods for determining the expression profile of Tarzan 
include RNAse protection assays, as known in the art. 

1 S Northern analysis is a laboratory technique used to detect the presence of a 

transcript of a gene and involves the hybridization of a labeled nucleotide sequence to a 
membrane on which RNAs from a particular cell type or tissue have been boimd. 
(Sambrook, supra, ch. 7 and Ausubel, F. M. et al. supra, ch. 4 and 16.) Analogous 
computer techniques ("electronic Northerns") applying BLAST may be used to search for 

20 identical or related molecules in nucleotide databases such as GenBank or the LIFESEQ 
database (Incyte Pharmaceuticals). This type of analysis has advantages in that they may 
be faster than multiple membrane^based hybridizations. In addition, the sensitivity of the 
computer search can be modified to determine whether any particular match is categorized 
as exact or homologous. 

25 The polynucleotides and polypeptides of the present invention, including the 

probes described above, may be employed as research reagents and materials for discovery 
of treatments and diagnostics to animal and human disease, as explained in fiirther detail 
elsewhere in this document. 
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Expression ofTarzan GPCR Polypeptides 

The invention includes a process for producing a Tarzan GPCR polypeptide. The 
method comprises in general culturing a host cell comprising a nucleic acid encoding 
Tarzan GPCR polypeptide, or a homologue, variant, or derivative thereof, under suitable 
5 conditions (i.e., conditions in which the Tarzan GPCR polypeptide is expressed). 

In order to express a biologically active Tarzan GPCR, the nucleotide sequences 
encoding Tarzan GPCR or homologues, variants, or derivatives thereof are inserted into 
appropriate expression vector, i.e., a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. 

10 Methods which are well known to those skilled in the art are used to construct 

expression vectors containing sequences encoding Tarzan GPCR and appropriate 
transcriptional and translational control elements. These methods include in vitro 
recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. 
Such techniques are described in Sambrook, J. et al. (1989; Molecular Cloning, A 

1 5 Laboratory Manual, ch, 4, 8, and 1 6- 1 7, Cold Spring Harbor Press, Plainview, N. Y.) and 
Ausubel, F. M. et al. (1995 and periodic supplements; Current Protocols in Molecular 
Biology, ch. 9, 13, and 16, John Wiley & Sons, New York, N.Y.). 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding Tarzan GPCR. These include, but are not limited to, microorganisms 

20 such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA 
expression vectors; yeast transformed with yeast expression vectors; insect cell systems 
infected with virus expression vectors (e.g., baculovirus); plant cell systems transformed 
with virus expression vectors (e.g., cauliflower mosaic virus (CaMV) or tobacco mosaic 
virus (TMV)) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 

25 animal cell systems. The invention is not limited by the host cell employed. 

The "control elements" or "regulatory sequences'* are those non-translated regions 
of the vector (i.e., enhancers, promoters, and 5* and 3' untranslated regions) which interact 
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with host cellular proteins to carry out transcription and translation. Such elements may 
vary in their strength and specificity. Depending on the vector system and host utilized, 
any number of suitable transcription and translation elements, including constitutive and 
inducible promoters, may be used. For example, when cloning in bacterial systems, 
5 inducible promoters such as the hybrid lacZ promoter of the BLUESCRIPT phagemid 
(Stratagene, La Jolla, Calif) or PSPORTl plasmid (GIBCO/BRL), and the like, may be 
used. The baculovirus polyhedrin promoter may be used in insect cells. Promoters or 
enhancers derived from the genomes of plant cells (e.g., heat shock, RUBISCO, and 
storage protein genes) or from plant viruses (e.g., viral promoters or leader sequences) 
10 may be cloned into the vector. In mammalian cell systems, promoters from mammalian 
genes or from mammalian viruses are preferable. If it is necessary to generate a cell line 
that contains multiple copies of the sequence encoding Tarzan GPCR, vectors based on 
SV40 or EBV may be used with an appropriate selectable marker. 

In bacterial systems, a number of expression vectors may be selected depending 
1 5 upon the use intended for Tarzan GPCR. For example, when large quantities of Tarzan 
GPCR are needed for the induction of antibodies, vectors which direct high level 
expression of fusion proteins that are readily purified may be used. Such vectors include, 
but are not limited to, multifunctional E. coli cloning and expression vectors such as 
BLUESCRIPT (Stratagene), in which the sequence encoding Tarzan GPCR may be 
20 ligated into the vector in frame with sequences for the amino-terminal Met and the 

subsequent 7 residues of p-galactosidase so that a hybrid protein is produced, pIN vectors 
(Van Heeke, G. and S. M. Schuster (1989) J. Biol. Chem. 264:5503-5509), and the like. 
pGEX vectors (Promega, Madison, Wis.) may also be used to express foreign polypeptides 
as frision proteins with glutathione S-transferase (GST). In general, such fusion proteins 
25 are soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. Proteins made in such 
systems may be designed to include heparin, thrombin, or factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 
will. 
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In the yeast Saccharomyces cerevisiae, a number of vectors containing constitutive 
or inducible promoters, such as alpha factor, alcohol oxidase, and PGH, may be used. For 
reviews, see Ausubel (supra) and Grant et al. (1987; Methods Enzympl. 153:516-544). 

In cases where plant expression vectors are used, the expression of sequences 
5 encoding Tarzan GPCR may be driven by any of a number of promoters. For example, 
viral promoters such as the 3 5S and 19S promoters of CaMV may be used alone or in 
combination with the omega leader sequence from TMV. (Takamatsu, N. (1987) EMBO J. 
6:307-31 1 .) Alternatively, plant promoters such as the small subimit of RUBISCO or heat 
shock promoters may be used. (Coruzzi, G. et al (1984) EMBO J. 3:1671-1680; Broglie, 

10 R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell 
Differ. 17:85-105.) These constructs can be introduced into plant cells by direct DNA 
transformation or pathogen-mediated transfection. Such techniques are described in a 
number of generally available reviews. (See, for example, Hobbs, S. or Muny, L. E. in 
McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York, 

15 N.Y.; pp. 191-196.). 



An insect system may also be used to express Tarzan GPCR. For example, in one 
such system, Autographa californica nuclear polyhedrosis virus (AcNPV) is used as a 
vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia larvae. 
The sequences encoding Tarzan GPCR may be cloned into a non-essential region of the 
20 virus, such as the polyhedrin gene, and placed under control of the polyhedrin promoter. 
Successful insertion of Tarzan GPCR will render the polyhedrin gene inactive and produce 
recombinant virus lacking coat protein. The recombinant viruses may then be used to 
infect, for example, SI frugiperda cells or Trichoplmia larvae in which Tarzan GPCR may 
be expressed. (Engelhard, E. K. et al. (1994) Proc. Nat. Acad. Sci. 91 :3224-3227.) 

25 In manmialian host cells, a number of viral-based expression systems may be 

utilized. In cases where an adenovirus is used as an expression vector, sequences encoding 
Tarzan GPCR may be ligated into an adenovirus transcription/translation complex 
consisting of the late promoter and tripartite leader sequence. Insertion in a non-essential 
El or E3 region of the viral genome may be used to obtain a viable virus which is capable 
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of expressing Tarzan GPCR in infected host cells. (Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. 81 :3655-3659.) In addition, transcription enhancers, such as the Rous 
sarcoma virus (RS V) enhancer, may be used to increase expression in mammalian host 
cells. 

5 Thus, for example, the Tarzan receptors of the present invention are expressed in 

either human embryonic kidney 293 (HEK293) cells or adherent dhfr CHO cells. To 
maximize receptor expression, typically all 5* and 3' untranslated regions (UTRs) are 
removed from the receptor cDNA prior to insertion into a pCDN or pCDNA3 vector. The 
cells are transfected with individual receptor cDNAs by lipofectin and selected in the 
10 presence of 400 mg/ml G418. After 3 weeks of selection, individual clones are picked and 
expanded for further analysis. HEK293 or CHO cells transfected with the vector alone 
serve as negative controls. To isolate cell lines stably expressing the individual receptors, 
about 24 clones are typically selected and analyzed by Northern blot analysis. Receptor 
mRNAs are generally detectable in about 50% of the G418-resistant clones analyzed. 

15 Human artificial chromosomes (HACs) may also be employed to deliver larger 

fragments of DNA than can be contained and expressed in a plasmid. HACs of about 6 kb 
to 10 Mb are constructed and delivered via conventional delivery methods (liposomes, 
polycationic amino polymers, or vesicles) for therapeutic purposes. 

Specific initiation signals may also be used to achieve more efficient translation of 
20 sequences encoding Tarzan GPCR. Such signals include the ATG initiation codon and 
adjacent sequences. In cases where sequences encoding Tarzan GPCR and its initiation 
codon and upstream sequences are inserted into the appropriate expression vector, no 
additional transcriptional or translational control signals may be needed. However, in 
cases where only coding sequence, or a fragment thereof, is inserted, exogenous * 
25 translational control signals including the ATG initiation codon should be provided. 
Furthermore, the initiation codon should be in the correct reading firame to ensure 
translation of the entire insert. Exogenous translational elements and initiation codons may 
be of various origins, both natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of enhancers appropriate for the particular cell system used, 
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such as those described in the literature. (Scharf, D. et al. (1994) Results Probl. Cell 
Differ. 20:125-162.) 

In addition, a host cell strain may be chosen for its ability to modulate expression 
of the inserted sequences or to process the expressed protein in the desired fashion. Such 
5 modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing 
which cleaves a "prepro" form of the protein may also be used to facilitate correct 
insertion, folding, and/or function. Different host cells which have specific cellular 
machinery and characteristic mechanisms for post-translational activities (e.g., CHO, 
1 0 HeLa, MDCK, HEK293, and WI38), are available from the American Type Culture 

Collection (ATCC, Bethesda, Md.) and may be chosen to ensure the correct modification 
and processing of the foreign protein. 

For long term, high yield production of recombinant proteins, stable expression is 
preferred. For example, cell lines capable of stably expressing Tarzan GPCR can be 

1 5 transformed using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a 
separate vector. Following the introduction of the vector, cells may be allowed to grow for 
about 1 to 2 days in enriched media before being switched to selective media. The purpose 
of the selectable marker is to confer resistance to selection, and its presence allows growth 

20 and recovery of cells which successfully express the introduced sequences. Resistant 
clones of stably transfomied cells may be proliferated using tissue culture techniques 
appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. 
These include, but are not limited to, the herpes simplex virus thymidine kinase genes 
25 (Wigler, M. et al. (1 977) Cell 1 1 :223-32) and adenine phosphoribosyltransferase genes 
(Lowy, 1. et al. (1980) Cell 22:817-23), which can be employed in tk' or apf cells, 
respectively. Also, antimetabolite, antibiotic, or herbicide resistance can be used as the 
basis for selection. For example, dhfir confers resistance to methotrexate (Wigler, M. et al. 
(1980) Proc. Natl. Acad. Sci. 77:3567-70); npt confers resistance to the aminoglycosides 
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neomycin and G-418 (Colbere-Garapin, F. et al (1981) J. Mol. Biol. 150:1-14); and als or 
pat confer resistance to chlorsulfiiron and phosphinotricin acetyltransferase, respectively 
(Murry, supra). Additional selectable genes have been described, for example, trpB, which 
allowrs cells to utilize indole in place of tryptophan, or hisD, which allows cells to utilize 
5 histinol in place of histidine. (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl. Acad. 
Sci. 85:8047-51.) Recently, the use of visible markers has gained popularity with such 
markers as anthocyanins, P-glucuronidase and its substrate GUS, and luciferase and its 
substrate luciferin. These markers can be used not only to identify transformants, but also 
to quantify the amount of transient or stable protein expression attributable to a specific 
10 vector system. (Rhodes, C. A. et al. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, the presence and expression of the gene may need to be confirmed. 
For example, if the sequence encoding Tarzan GPCR is inserted within a marker gene 
sequence, transformed cells containing sequences encoding Tarzan GPCR can be 
1 5 identified by the absence of marker gene function. Alternatively, a marker gene can be 
placed in tandem with a sequence encoding Tarzan GPCR under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually 
indicates expression of the tandem gene as well. 

Alternatively, host cells which contain the nucleic acid sequence encoding Tarzan 
20 GPCR and express Tarzan GPCR may be identified by a variety of procedures known to 
those of skill in the art. These procedures include, but are not limited to, DNA-DNA or 
DNA-RNA hybridizations and protein bioassay or immunoassay techniques which include 
membrane, solution, or chip based technologies for the detection and/or quantification of 
nucleic acid or protein sequences. 

25 The presence of polynucleotide sequences encoding Tarzan GPCR can be detected 

by DNA-DNA or DNA-RNA hybridization or amplification using probes or fragments or 
fragments of polynucleotides encoding Tarzan GPCR. Nucleic acid amplification based 
assays involve the use of oligonucleotides or oligomers based on the sequences encoding 
Tarzan GPCR to detect transfonnants containing DNA or RNA encoding Tarzan GPCR. 
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A variety of protocols for detecting and measuring the expression of Tarzan 
GPCR, using either polyclonal or monoclonal antibodies specific for the protein, are 
known in the art. Examples of such techniques include enzyme-linked immimosorbent 
assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell sorting 
5 (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering epitopes on Tarzan GPCR is preferred, but a competitive 
binding assay may be employed. These and other assays are well described in the art, for 
example, in Hampton, R. et al. (1990; Serological Methods, a Laboratory Manual, Section 
IV, APS Press, St Paul, Minn.) and in Maddox, D. E. et al. (1983; J. Exp. Med. 158:121 1- 
10 1216). 

A wide variety of labels and conjugation techniques are known by those skilled in 
the art and may be used in various nucleic acid and amino acid assays. Means for 
producing labeled hybridization or PGR probes for detecting sequences related to 
polynucleotides encoding Tarzan GPCR include oligolabeling, nick translation, end* 

1 S labeling, or PCR amplification using a labeled nucleotide. Alternatively, the sequences 
encoding Tarzan GPCR, or any fi-agments thereof, may be cloned into a vector for the 
production of an mRNA probe. Such vectors are known in the art, are commercially 
available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These 

20 procedures may be conducted using a variety of conunercially available kits, such as those 
provided by Pharmacia & Upjohn (Kalamazoo, Mich.), Promega (Madison, Wis.), and 
U.S. Biochemical Corp. (Cleveland, Ohio). Suitable reporter molecules or labels which 
may be used for ease of detection include radionuclides, enzymes, fluorescent, 
chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 

25 magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding Tarzan GPCR may be 
cultured under conditions suitable for the expression and recovery of the protein fi^om cell 
culture. The protein produced by a transformed cell may be located in the cell membrane, 
secreted or contained intracellularly depending on the sequence and/or the vector used. As 
30 will be understood by those of skill in the art, expression vectors containing 
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polynucleotides which encode Tarzan GPCR may be designed to contain signal sequences 
which direct secretion of Tarzan GPCR through a prokaryotic or eukaryotic cell 
membrane. Other constructions may be used to join sequences encoding Tarzan GPCR to 
nucleotide sequences encoding a polypeptide domain which will facilitate purification of 
5 soluble proteins. Such purification facilitating domains include, but are not limited to, 
metal chelating peptides such as histidine-tryptophan modules that allow purification on 
immobilized metals, protein A domains that allow purification on immobilized 
immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification 
system (Immunex Corp., Seattle, Wash.). The inclusion of cleavable linker sequences, 

1 0 such as those specific for Factor XA or enterokinase (Invitrogen, San Diego, Calif.), 

between the purification domain and the Tarzan GPCR encoding sequence may be used to 
facilitate purification. One such expression vector provides for expression of a fusion 
protein containing Tarzan GPCR and a nucleic acid encoding 6 histidine residues 
preceding a thioredoxin or an enterokinase cleavage site. The histidine residues facilitate 

1 S purification on immobilized metal ion afGnity chromatography (IMIAC; described in 
Porath, J. et al. (1992) Prot. Exp. Purif. 3: 263-281), while the enterokinase cleavage site 
provides a means for purifying Tarzan GPCR firom the fusion protein. A discussion of 
vectors which contain fusion proteins is provided in Kroll, D. J. et al. (1993; DNA Cell 
Biol. 12:441-453). 

20 Fragments of Tarzan GPCR may be produced not only by recombinant production, 

but also by direct peptide synthesis xising solid-phase techniques, (Merrifield J. (1963) J. 
Am, Chem. Soc. 85:2149-2154.) Protein synthesis may be performed by manual 
techniques or by automation. Automated synthesis may be achieved, for example, using 
the Applied Biosystems 431 A peptide synthesizer (Perkin Elmer). Various fragments of 

25 Tarzan GPCR may be synthesized separately and then combined to produce the full length 
molecule. 

Biosensors 

The Tarzan polypeptides, nucleic acids, probes, antibodies, expression vectors and 
ligands are useful as (and for the production of) biosensors. 
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According to Aizawa (1988), AnaL Chem, Symp, 17: 683, a biosensor is defined as 
being a unique combination of a receptor for molecular recognition, for example a 
selective layer with immobilized antibodies or receptors such as a Tarzan G-protein 
coupled receptor, and a transducer for transmitting the values measured. One group of 
5 such biosensors will detect the change which is caused in the optical properties of a 
surface layer due to the interaction of the receptor with the surrounding medium. Among 
such techniques may be mentioned especially ellipso-metry and surface plasmon 
resonance. Biosensors incorporating Tarzan may be used to detect the presence or level of 
Tarzan ligands, for example, nucleotides such as purines or purine analogues, or analogues 
10 of these ligands. The construction of such biosensors is well known in the art. 

Thus, cell lines expressing Tarzan receptor may be used as reporter systems for 
detection of ligands such as ATP via receptor-promoted formation of [3H]inositol 
phosphates or other second messengers (Watt et al., 1998, J Biol Chem May 
29;273(22): 14053-8). Receptor-ligand biosensors are also described in Hoffman et al., 
15 2000, Proc Natl Acad Sci USA Oct 10;97(21):1 1215-20. Optical and other biosensors 
comprising Tarzan may also be used to detect the level or presence of interaction with G- 
proteins and other proteins, as described by, for example, Figler et al, 1997, Biochemistry 
Dec 23;36(51):16288-99 and Sarrio et al., 2000, Mol Cell Biol 2000 Jul;20(14):51 64-74). 
Sensor units for biosensors are described in, for example, US 5,492,840. 

20 Screening Assays 

The Tarzan GPCR polypeptide of the present invention, including homologues, 
variants, and derivatives, whether natural or recombinant, may be employed in a screening 
process for compounds which bind the receptor and which activate (agonists) or inhibit 
activation of (antagonists) of Tarzan. Thus, polypeptides of the invention may also be used 
25 to assess the binding of small molecule substrates and ligands in, for example, cells, cell- 
free preparations, chemical libraries, and natural product mixtures. These substrates and 
ligands may be natural substrates and ligands or may be structural or functional mimetics. 
See Coligan et al.. Current Protocols in Immunology l(2):Chapter 5 (1991). 
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Tarzan GPCR polypeptides are responsible for many biological functions, 
including many pathologies. Accordingly, it is desirous to find compounds and drugs 
which stimulate Tarzan GPCR on the one hand and which can inhibit the function of 
Tarzan GPCR on the other hand. In general, agonists and antagonists are employed for 
5 therapeutic and prophylactic purposes for such conditions as infections such as bacterial, 
fungal, protozoan and viral infections, particularly infections caused by HIV-1 or HIV-2; 
pain; cancers; diabetes, obesity; anorexia; bulimia; asthma; parkinson's disease; 
thrombosis; acute heart failure; hypotension; hypertension; erectile dysfunction; urinary 
retention; metabolic bone diseases such as osteoporisis and osteo petrosis; angina pectoris; 
10 myocardial mfarction; ulcers; asthma; allergies; rheumatoid arthritis; inflammatory bowel 
disease; irritable bowel syndrome benign prostatic hypertrophy; and psychotic and 
neurological disorders, including anxiety, schizophrenia, manic depression, delirium, 
dementia, severe mental retardation and dyskinesias, such as Huntington's disease or 
Gilles delaTourett's syndrome. 

1 5 Rational design of candidate compoimds likely to be able to interact with Tarzan 

GPCR protein may be based upon structural studies of the molecular shapes of a 
polypeptide according to the invention. One means for determining which sites interact 
with specific other proteins is a physical structure determination, e.g., X-ray 
crystallography or two-dimensional NMR techniques. These will provide guidance as to 

20 which amino acid residues form molecular contact regions. For a detailed description of 
protein structural determination, see, e.g., Blundell and Johnson (1976) Protein 
Crystallography, Academic Press, New York. 

An alternative to rational design uses a screening procedure which involves in 
general producing appropriate cells which express the Tarzan receptor polypeptide of the 
25 present invention on the surface thereof Such cells include cells fi^m animals, yeast, 
Drosophila or £. coll Cells expressing the receptor (or cell membrane containing the 
expressed receptor) are then contacted with a test compound to observe binding, or 
stimulation or inhibition of a functional response. For example, Xenopus oocytes may be 
injected with Tarzan mRNA or polypeptide, and currents induced by exposure to test 
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compounds measured by use of voltage clamps measured, as described in further detail 
elsewhere. 

Furthermore, microphysiometric assays may be employed to assay Tarzan receptor 
activity. Activation of a wide variety of secondary messenger systems results in extrusion 
5 of small amoimts of acid from a cell. The acid formed is largely as a result of the increased 
metabolic activity required to fuel the intracellular signalling process. The pH changes in 
the media suiroxmding the cell are very small but are detectable by, for example, the 
CYTOSENSOR microphysiometer (Molecular Devices Ltd., Menlo Park, Calif.). The 
C YTOSENSOR is thus capable of detecting the activation of a receptor which is coupled 
10 to an energy utilizing intracellular signaling pathway such as the G-protein coupled 
receptor of the present invention. 

Instead of testing each candidate compound individually with the Tarzan receptor, 
a library or bank of candidate ligands may advantageously be produced and screened. 
Thus, for example, a bank of over 200 putative receptor ligands has been assembled for 

1 5 screening. The bank comprises: transmitters, hormones and chemokines known to act via a 
human seven transmembrane (7TM) receptor; naturally occurring compounds which may 
be putative agonists for a human 7TM receptor, non-mammalian, biologically active 
peptides for which a mammalian counterpart has not yet been identified; and compounds 
not found in nature, but which activate 7TM receptors with imknown natural ligands. This 

20 bank is used to screen the receptor for known ligands, using both functional (i.e. calcium, 
cAMP, microphysiometer, oocyte electrophysiology, etc, see elsewhere) as well as 
binding assays as described in further detail elsewhere. However, a large number of 
mammalian receptors exist for which there remains, as yet, no cognate activating ligand 
(agonist) or deactivating Ugand (antagonist). Thus, active Ugands for these receptors may 

25 not be included within the ligands banks as identified to date. Accordingly, theTarzan 
recq)tor of the mvention is also functionally screened (using calcium, cAMP, 
microphysiometer, oocyte electrophysiology, etc., functional screens) against tissue 
extracts to identify natural ligands. Extracts that produce positive functional responses can 
be sequentially subfiractionated, with the fractions being assayed as described here, until 

30 an activating ligand is isolated and identified. 
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7TM receptors which are expressed in HEK 293 cells have been shown to be 
coupled fiinctionally to activation of PLC and calcium mobilization and/or cAMP 
stimuation or inhibition. One screening technique therefore includes the use of cells which 
express the Tarzan GPCR receptor of this invention (for example, transfected Xenopus 
5 oocytes, CHO or HEK293 cells) in a system which measures extracellular pH or 

intracellular calcium changes caused by receptor activation. In this technique, compounds 
may be contacted with cells expressing the receptor polypeptide of the present invention. 
A second messenger response, e.g., signal transduction, pH changes, or changes in 
calcium level, is then measured to determine whether the potential compound activates or 
1 0 inhibits the receptor. 

In such experiments, basal calcium levels in the HEK 293 cells in receptor- 
transfected or vector control cells are observed to be in the normal, 1 00 nM to 200 nM, 
range. HEK 293 cells expressing Tarzan GPCR or recombinant Tarzan GPCR are loaded 
with fiira 2 and in a single day more than 150 selected ligands or tissue/cell extracts are 
1 5 evaluated for agonist induced calciimi mobilization. Similarly, HEK 293 cells expressing 
Tarzan GPCR or recombinant Tarzan GPCR are evaluated for the stimulation or inhibition 
of cAMP production using standard cAMP quantitation assays. Agonists presenting a 
calcium transient or cAMP fluctuation are tested in vector control cells to determine if the 
response is unique to the transfected cells expressing receptor. 

20 Another method involves screening for receptor inhibitors by determining 

inhibition or stimulation of Tarzan receptor-mediated cAMP and/or adenylate cyclase 
accumulation. Such a method involves transfecting a eukaryotic cell with the receptor of 
this invention to express the receptor on the cell surface. The cell is then exposed to 
potential antagonists in the presence of the receptor of this invention. The amount of 

25 cAMP accumulation is then measured. If the potential antagonist binds the receptor, and 
thus inhibits receptor binding, the levels of receptor-mediated cAMP, or adenylate cyclase, 
activity will be reduced or increased. 
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Another method for detecting agonists or antagonists for the receptor of the present 
invention is the yeast based technology as described in U.S. Pat. No. 5,482,835, 
incorporated by reference herein. 

Where the candidate compounds are proteins, in particular antibodies or peptides, 
5 libraries of candidate compounds may be screened using phage display techniques. Phage 
display is a protocol of molecular screening which utilises recombinant bacteriophage. 
The technology involves transforming bacteriophage with a gene that encodes one 
compound from the library of candidate compounds, such that each phage or phagemid 
expresses a particular candidate compound. The transformed bacteriophage (which 

1 0 preferably is tethered to a solid support) expresses the appropriate candidate compound 
and displays it on their phage coat. Specific candidate compounds which are capable of 
binding to a polypeptide or peptide of the invention are enriched by selection strategies 
based on affinity interaction. The successful candidate agents are then characterised. 
Phage display has advantages over standard affinity ligand screening technologies. The 

15 phage surface displays the candidate agent in a three dimensional configuration, more 

closely resembling its naturally occurring conformation. This allows for more specific and 
higher affinity binding for screening purposes. 

Another method of screening a library of compounds utilises eukaryotic or 
prokaryotic host cells which are stably transformed with recombinant DNA molecules 

20 expressing a library of compoimds. Such cells, either in viable or fixed form, can be used 
for standard binding-partner assays. See also Parce et al (1989) Science 246:243-247; and 
Owicki et al (1990) Proc. Nat'l Acad. Sci. USA 87;4007-401 1, which describe sensitive 
methods to detect cellular responses. Competitive assays are particularly useful, where the 
cells expressing the library of compounds are contacted or incubated with a labelled 

25 antibody known to bind to a Tarzan polypeptide of the present invention, such as '^^I- 
antibody, and a test sample such as a candidate compound whose binding affinity to the 
binding composition is being measured. The bound and firee labelled binding partners for 
the polypeptide are then separated to assess the degree of bindmg. The amount of test 
sample bound is inversely proportional to the amount of labelled antibody bmding to the 

30 polypeptide. 
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Any one of numerous techniques can be used to separate bound from free binding 
partners to assess the degree of binding. This separation step could typically involve a 
procedure such as adhesion to filters followed by washing, adhesion to plastic following 
by washing, or centrifiigation of the cell membranes. 

5 Still another approach is to use solubilized, unpurified or solubilized purified 

polypeptide or peptides, for example extracted from transformed eukaryotic or prokaryotic 
host cells. This allows for a "molecular" binding assay with the advantages of increased 
specificity, the ability to automate, and high drug test throughput. 

Another technique for candidate compound screening involves an approach which 
10 provides high throughput screening for new compounds having suitable binding affinity, 
e.g., to a polypeptide of the invention, and is described in detail in International Patent 
application no. WO 84/03564 (Commonwealth Serum Labs.), published on September 13 
1 984. First, large numbers of different small peptide test compounds are synthesized on a 
solid substrate, e.g., plastic pins or some other appropriate surface; see Fodor et al. (1991). 
1 5 Then all the pins are reacted with solubilized polypeptide of the invention and washed. 
The next step involves detecting bound polypeptide. Compounds which interact 
specifically Vidth the polypeptide will thus be identified. 

Ligand binding assays provide a direct method for ascertaining receptor 
pharmacology and are adaptable to a high throughput format. The purified ligand for a 

20 receptor may be radiolabeled to high specific activity (50-2000 Ci/mmol) for binding 
studies. A determination is then made that the process of radiolabeling does not diminish 
the activity of the ligand towards its receptor. Assay conditions for buffers, ions, pH and 
other modulators such as nucleotides are optimized to establish a workable signal to noise 
ratio for both membrane and whole cell receptor sources. For these assays, specific 

25 receptor binding is defined as total associated radioactivity minus the radioactivity 
measured in the presence of an excess of unlabeled competing ligand. Where possible, 
more than one competing ligand is used to define residual nonspecific bmding. 
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The assays may simply test binding of a candidate compound wherein adherence to 
the cells bearing the receptor is detected by means of a label directly or indirectly 
associated with the candidate compound or in an assay involving competition with a 
labeled competitor. Further, these assays may test whether the candidate compoimd results 
5 in a signal generated by activation of the receptor, using detection systems appropriate to 
the cells bearing the receptor at their surfaces. Inhibitors of activation are generally 
assayed in the presence of a known agonist and the effect on activation by the agonist by 
the presence of the candidate compound is observed. 

Further, the assays may simply comprise the steps of mixing a candidate 
10 compound with a solution containing a Tarzan GPCR polypeptide to form a mixture, 

measuring Tarzan GPCR activity in the mixture, and comparing the Tarzan GPCR activity 
of the mixture to a standard. 

The Tarzan GPCR cDNA, protein and antibodies to the protein may also be used to 
configure assays for detecting the effect of added compounds on the production of Tarzan 

1 5 GPCR mRN A and protein in cells. For example, an ELIS A may be constructed for 
measuring secreted or cell associated levels of Taizan GPCR protein using monoclonal 
and polyclonal antibodies by standard methods known in the art, and this can be used to 
discover agents which may inhibit or enhance the production of Tarzan GPCR (also called 
antagonist or agonist, respectively) from suitably manipulated cells or tissues. Standard 

20 methods for conducting screening assays are well understood in the art. 

Examples of potential Tarzan GPCR antagonists include antibodies or, in some 
cases, nucleotides and their analogues, including purines and purine analogues, 
oligonucleotides or proteins which are closely related to the ligand of the Tarzan GPCR, 
e.g., a fragment of the ligand, or small molecules which bind to the receptor but do not 
25 elicit a response, so that the activity of the receptor is prevented. 

The present invention therefore also provides a compound capable of binding 
specifically to a Tarzan polypeptide and/br peptide of the present invention. 
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The term "compound" refers to a chemical compoimd (naturally occurring or 
synthesised), such as a biological macromolecule (e.g., nucleic acid, protein, non-peptide, 
or organic molecule), or an extract made from biological materials such as bacteria, plants, 
fungi, or animal (particularly mammalian) cells or tissues, or even an inorganic element or 
5 molecule. Preferably the compound is an antibody. 

The materials necessary for such screening to be conducted may be packaged into 
a screening kit. Such a screening kit is useful for identifying agonists, antagonists, ligands, 
receptors, substrates, enzymes, etc. for Tarzan GPCR polypeptides or compounds which 
decrease or enhance the production of Tarzan GPCR polypeptides. The screening kit 
10 comprises: (a) a Tarzan GPCR polypeptide; (b) a recombinant cell expressing a Tarzan 
GPCR polypeptide; (c) a cell membrane expressing a Tarzan GPCR polypeptide; or (d) 
antibody to a Tarzan GPCR polypeptide. The screening kit may optionally comprise 
instructions for use. 

Transgenic Animals 

15 The present invention further encompasses transgenic animals capable of 

expressing natural or recombinant Tarzan GPCR, or a homologue, variant or derivative, at 
elevated or reduced levels compared to the normal expression level. Included are 
transgenic animals ("Tarzan knockout"s) which do not express functional Tarzan receptor. 
The Tarzan knockouts may arise as a result of functional disruption of the Tarzan gene or 

20 any portion of that gene, including one or more loss of function mutations, including a 

deletion or replacement, of the Tarzan gene. The mutations include single point mutations, 
and may target coding or non-coding regions of Tarzan. 

Preferably, such a transgenic animal is a non-human mammal, such as a pig, a 
sheep or a rodent. Most preferably the transgenic animal is a mouse or a rat Such 
25 transgenic animals may be used in screening procedures to identify agonists and/or 

antagonists of Tarzan GPCR, as well as to test for their efficacy as treatments for diseases 
in vivo. 
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For example, transgenic animals that have been engineered to be deficient in the 
production of Tarzan GPCR may be used in assays to identify agonists and/or antagonists 
of Tarzan GPCR. One assay is designed to evaluate a potential drug (aa candidate ligand 
or compound) to determine if it produces a physiological response in the absence of 
5 Tarzan GPCR receptors. This may be accomplished by administering the drug to a 
transgenic animal as discussed above, and then assaying the animal for a particular 
response. Any physiological parameter could be measured in this assay. 

Tissues derived from the Tarzan knockout animals may be used in receptor binding 
assays to determine whether the potential drug (a candidate ligand or compound) binds to 

1 0 the tarzan receptor. Such assays can be conducted by obtaining a first receptor 

preparation from the transgenic animal engineered to be deficient in Tarzan receptor 
production and a second receptor preparation from a source known to bind any identified 
Tarzan ligands or compounds. In general, the first and second receptor preparations will be 
similar in all respects except for the source from which they are obtamed. For example, if 

1 5 brain tissue from a transgenic animal (such as described above and below) is used in an 
assay, comparable brain tissue from a normal (wild type) animal is used as the source of 
the second receptor preparation. Each of the receptor preparations is incubated with a 
ligand known to bind to Tarzan receptors, both alone and in the presence of the candidate 
ligand or compound. Preferably, the candidate ligand or compoimd will be examined at 

20 several different concentrations. 

The extent to which binding by the known ligand is displaced by the test 
compoimd is determined for both the first and second receptor preparations. Tissues 
derived from transgenic animals may be used in assays directly or the tissues may be 
processed to isolate membranes or membrane proteins, which are themselves used in the 
25 assays. A preferred transgenic animal is the mouse. The ligand may be labeled using any 
means compatible with binding assays. This would include, without lunitation, 
radioactive, enzymatic, fluorescent or chemiluminescent labeling (as well as other 
labelling techniques as described in further detail above). 
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Furthermore, antagonists of Tarzan GPCR receptor may be identified by 
administering candidate compounds, etc, to wild type animals expressing functional 
Tarzan, and animals identified which exhibit any of the phenotypic characteristics 
associated with reduced or abolished expression of Tarzan receptor fimction. 

5 Detailed methods for generating non-human transgenic animal are described in 

further detail below. Transgenic gene constructs can be introduced into the germ line of an 
animal to make a transgenic mammal. For example, one or several copies of the construct 
may be incorporated into the genome of a mammalian embryo by standard transgenic 
techniques. 

10 In an exemplary embodiment, the transgenic non-human animals of the invention 

are produced by introducing transgenes into the germline of the non-human animal. 
Embryonal target cells at various developmental stages can be used to introduce 
transgenes. Different methods are used depending on the stage of development of the 
embryonal target cell. The specific line(s) of any animal used to practice this invention are 

1 5 selected for general good health, good embryo yields, good pronuclear visibility in the 
embryo, and good reproductive fitness. In addition, the haplotype is a significant factor. 

Introduction of the transgene into the embryo can be accomplished by any means 
known in the art such as, for example, microinjection, electroporation, or lipofection. For 
example, the Tarzan receptor transgene can be introduced into a mammal by 

20 microinjection of the construct into the pronuclei of the fertilized mammalian egg(s) to 
cause one or more copies of the construct to be retained in the cells of the developing 
manmial(s). Following introduction of the transgene construct into the fertilized egg, the 
egg may be incubated in vitro for varying amounts of time, or reimplanted into the 
surrogate host, or both. In vitro incubation to maturity is within the scope of this invention. 

25 One common method in to incubate the embryos in vitro for about 1-7 days, depending on 
the species, and then reimplant them into the surrogate host. 

The progeny of the transgenically manipulated embryos can be tested for the 
presence of the construct by Southern blot analysis of the segment of tissue. If one or more 
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copies of the exogenous cloned construct remains stably integrated into the genome of 
such transgenic embryos, it is possible to establish permanent transgenic mammal lines 
carrying the transgenically added construct. 

The litters of transgenically altered mammals can be assayed after birth for the 
5 incorporation of the construct mto the genome of the offspring. Preferably, this assay is 
accomplished by hybridizing a probe corresponding to the DNA sequence coding for the 
desired recombinant protein product or a segment thereof onto chromosomal material from 
the progeny. Those mammalian progeny found to contain at least one copy of the construct 
in their genome are grown to maturity. 

For the purposes of this invention a zygote is essentially the formation of a diploid 
cell which is capable of developing into a complete organism. Generally, the zygote vnl\ 
be comprised of an egg containing a nucleus formed, either naturally or artificially, by the 
fusion of two haploid nuclei from a gamete or gametes. Thus, the gamete nuclei must be 
ones which are naturally compatible, i.e., ones which result in a viable zygote capable of 
undergoing differentiation and developing into a functioning organism. Generally, a 
euploid zygote is preferred. If an aneuploid zygote is obtained, then the number of 
chromosomes should not vary by more than one with respect to the euploid number of the 
organism from which either gamete originated. 

In addition to similar biological considerations, physical ones also govern the 
20 amount (e.g., volimie) of exogenous genetic material which can be added to the nucleus of 
the zygote or to the genetic material which forms a part of the zygote nucleus. If no 
genetic material is removed, then the amount of exogenous genetic material which can be 
added is limited by the amount which will be absorbed without being physically 
disruptive. Generally, the volume of exogenous genetic material inserted will not exceed 
25 about 10 picoliters. The physical effects of addition must not be so great as to physically 
destroy the viability of the zygote. The biological limit of the number and variety of DNA 
sequences will vary depending upon the particular zygote and functions of the exogenous 
genetic material and wall be readily apparent to one skilled in the art, because the genetic 
material, including the exogenous genetic material, of the resulting zygote must be 
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biologically capable of initiating and maintaining the differentiation and development of 
the zygote into a fimctional organism. 

The number of copies of the transgene constructs which are added to the zygote is 
dependent upon the total amount of exogenous genetic material added and will be the 
5 amount which enables the genetic transformation to occur. Theoretically only one copy is 
required; however, generally, numerous copies are utilized, for example, 1,000-20,000 
copies of the transgene construct, in order to insure that one copy is functional. As regards 
the present invention, there will often be an advantage to having more than one 
functioning copy of each of the inserted exogenous DNA sequences to enhance the 
1 0 phenotypic expression of the exogenous DNA sequences. 

Any technique which allows for the addition of the exogenous genetic material into 
nucleic genetic material can be utilized so long as it is not destructive to the cell, nuclear 
membrane or other existing cellular or genetic structures. The exogenous genetic material 
is preferentially inserted into the nucleic genetic material by microinjection. 
1 5 Microinjection of cells and cellular structures is known and is used in the art. 

Reimplantation is accomplished using standard methods. Usually, the surrogate 
host is anesthetized, and the embryos are inserted into the oviduct. The number of 
embryos implanted into a particular host will vary by species, but will usually be 
comparable to the number of off spring the species naturally produces. 

20 Transgenic offspring of the surrogate host may be screened for the presence and/or 

expression of the transgene by any suitable method. Screening is often accomplished by 
Southern blot or Northern blot analysis, using a probe that is complementary to at least a 
portion of the transgene. Western blot analysis using an antibody against the protein 
encoded by the transgene may be employed as an alternative or additional method for 

25 screening for the presence of the transgene product Typically, DNA is prepared from tail 
tissue and analyzed by Southern analysis or PGR for the transgene. Alternatively, the 
tissues or cells believed to express tiie transgene at the highest levels are tested for tiie 
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presence and expression of the transgene using Southern analysis or PGR, although any 
tissues or ceil types may be used for this analysis. 

Alternative or additional methods for evaluating the presence of the transgene 
include, without limitation, suitable biochemical assays such as enzyme and/or 
5 immunological assays, histological stains for particular marker or enzyme activities, flow 
cytometric analysis, and the like. Analysis of the blood may also be useful to detect the 
presence of the transgene product in the blood, as well as to evaluate the effect of the 
transgene on the levels of various types of blood cells and other blood constituents. 

Progeny of the transgenic animals may be obtained by mating the transgenic 
1 0 animal with a suitable partner, or by in vitro fertilization of eggs and/or spemi obtained 
from the transgenic animal. Where mating with a partner is to be performed, the partner 
may or may not be transgenic and/or a knockout; where it is transgenic, it may contain the 
same or a different transgene, or both. Alternatively, the partner may be a parental line. 
Where in vitro fertilization is used, the fertilized embryo may be implanted into a 
15 surrogate host or incubated in vitro, or both. Using either method, the progeny may be 
evaluated for the presence of the transgene using methods described above, or other 
appropriate methods. 

The transgenic animals produced in accordance with the present invention will 
include exogenous genetic material. As set out above, the exogenous genetic material will, 
20 in certain embodiments, be a DNA sequence which results in the production of a Tarzan 
GPCR receptor. Further, in such embodiments the sequence wall be attached to a 
transcriptional control element, e.g., a promoter, which preferably allows the expression of 
the transgene product in a specific type of cell. 

Retroviral infection can also be used to introduce transgene into a non-human 
25 animal. The developing non-hiunan embryo can be cultured in vitro to the blastocyst stage. 
During this time, the blastomeres can be targets for retroviral infection (Jaenich, R. (1976) 
PNAS 73:1260-1264). Efficient infection of the blastomeres is obtained by enzymatic 
treatment to remove the zona pellucida (Manipulating the Mouse Embryo, Hogan eds. 
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(Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1986). The viral vector 
system used to introduce the transgene is typically a replication-defective retrovirus 
carrying the transgene (Jahner et al. (1985) PNAS 82:6927-6931; Van der Putten et al. 
(1985) PNAS 82:6148-6152). Transfection is easily and efficiently obtained by culturing 
5 the blastomeres on a monolayer of virus-producing cells (Van der Putten, supra; Stewart et 
al. (1987) EMBO J. 6:383-388). Alternatively, infection can be performed at a later stage. 
Virus or virus-producing cells can be injected into the blastocoele (Jahner et al (1982) 
Nature 298:623-628). Most of the founders will be mosaic for the transgene since 
incorporation occurs only in a subset of the cells which formed the transgenic non-human 
10 animal. Further, the founder may contain various retroviral insertions of the transgene at 
different positions in the genome which generally will segregate in the offspring. In 
addition, it is also possible to introduce transgenes into the germ Ime by intrauterine 
retroviral infection of the midgestation embryo (Jahner et al. (1982) supra). 

A third type of target cell for transgene introduction is the embryonal stem cell 
1 5 (ES). ES cells are obtained from pre-implantation embryos cultured in vitro and fused with 
embryos (Evans et al. (1981) Nature 292:154-156; Bradley et al. (1984) Nature 309:255- 
258; Gossler et al. (1986) PNAS 83: 9065-9069; and Robertson et al. (1986) Nature 
322:445-448). Transgenes can be efficiently introduced into the ES cells by DNA 
transfection or by retrovirus-mediated transduction. Such transformed ES cells can 
20 thereafter be combined with blastocysts from a non-human animal. The ES cells thereafter 
colonize the embryo and contribute to the germ line of the resulting chimeric animal. For 
review see Jaenisch, R. (1988) Science 240:1468-1474. 

We also provide non-human transgenic animals, where the transgenic animal is 
characterized by having an altered Tarzan gene, preferably as described above, as models 

25 for Tarzan receptor function. Alterations to the gene include deletions or other loss of 
Amotion mutations, introduction of an exogenous gene having a nucleotide sequence with 
targeted or random mutations, introduction of an exogenous gene from another species, or 
a combination thereof. The transgenic animals may be either homozygous or heterozygous 
for the alteration. The animals and cells derived therefrom are useful for screening 

30 biologically active agents that may modulate Tarzanreceptor fimction. The screening 
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methods are of particular use for determining the specificity and action of potential 
therapies for infections such as bacterial, fungal, protozoan and viral infections, 
particularly infections caused by HIV-l or HIV-2; pain; cancers; diabetes, obesity; 
anorexia; bulimia; asthma; parkinson's disease; thrombosis; acute heart failure; 
5 hypotension; hypertension; erectile dysfunction; urinary retention; metabolic bone 
diseases such as osteoporisis and osteo petrosis; angina pectoris; myocardial infarction; 
ulcers; asthma; allergies; rheumatoid arthritis; inflammatory bowel disease; irritable bowel 
syndrome benign prostatic hypertrophy; and psychotic and neurological disorders, 
mcluding anxiety,.schizophrehia, manic depression, delirium, dementia, severe mental 
1 0 retardation and dyskinesias, such as Huntington's disease or Gilles dela Tourett*s 

syndrome. The animals are useful as a model to investigate the role of Tarzan receptors in 
normal brain, heart, spleen and liver function. 

Another aspect of the invention pertains to a transgenic nonhuman animal having a 
fimctionally disrupted endogenous Tarzan gene but which also carries in its genome, and 

1 5 expresses, a transgene encoding a heterologous Tarzan protein (i.e., a Tarzan from another 
species). Preferably, the animal is a mouse and the heterologous Tarzan is a human 
Tarzan. An animal, or cell lines derived from such an animal of the invention, which has 
been reconstituted with human Tarzan, can be used to identify agents that inhibit human 
Tarzan in vivo and in vitro. For example, a stimulus that induces signalling through human 

20 Tarzan can be administered to the animal, or cell line, in the presence and absence of an 
agent to be tested and the response in the animal, or cell line, can be measured. An agent 
that inhibits human Tarzan in vivo or in vitro can be identified based upon a decreased 
response in the presence of the agent compared to the response in the absence of the agent. 

The present invention also provides for a Tarzan GPCR deficient transgenic non- 
25 human animal (a "Tarzan GPCR knock-out")- Such an animal is one which expresses 
lowered or no Tarzan GPCR activity, preferably as a result of an endogenous Tarzan 
GPCR genomic sequence being disrupted or deleted. The endogenous Tarzan GPCR 
genomic sequence may be replaced by a null allele, which may comprise non-functional 
portions of the wild-type Tarzan sequence. For example, the endogenous Tarzan GPCR 
30 genomic sequence may be replaced by an allele of Tarzan comprising a disrupting 
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sequence which may comprise heterologous sequences, for example, reporter sequences 
and/or selectable markers. Preferably, the endogenous Tarzan GPCR genomic sequence in 
a Tarzan knock-out mouse is replaced by an allele of Tarzan in which one or more, 
preferably all, of the transmembrane sequences is replaced by such a disrupting sequence, 
5 preferably a lacZ sequence and a neomycin resistance sequence. Preferably, the genomic 
Tarzan sequence which is functionally disrupted comprises a mouse Tarzan genomic 
sequence shown in SEQ ID NO: 10. 

Preferably, such an animal expresses no GPCR activity. More preferably, the 
animal expresses no activity of the Tarzan GPCR shown as SEQ ID NO: 3, SEQ ID NO: 
10 5, SEQ ID NO: 6 or SEQ ID NO: 8. Tarzan GPCR knock-outs may be generated by 

various means known in the art, as described in further detail below. A specific description 
of the construction of a Tarzan knock-out mouse is disclosed in Examples 1 et seq. 

The present invention also pertains to a nucleic acid construct for functionally 
disrupting a Tarzan gene in a host celL The nucleic acid construct comprises: a) a non- 
1 5 homologous replacement portion; b) a first homology region located upstream of the non- 
homologous replacement portion, the first homology region having a nucleotide sequence 
with substantial identity to a first Tarzan gene sequence; and c) a second homology region 
located downstream of the non-homologous replacement portion, the second homology 
region having a nucleotide sequence with substantial identity to a second Tarzan gene 
20 sequence, the second Tarzan gene sequence having a location downstream of the first 

Tarzan gene sequence in a naturally occurring endogenous Tarzan gene. Additionally, the 
first and second homology regions are of sufficient length for homologous recombination 
between the nucleic acid construct and an endogenous Tarzan gene in a host cell when the 
nucleic acid molecule is introduced into the host cell. In a preferred embodiment, the non- 
25 homologous replacement portion comprises an expression reporter, preferably including 
lacZ and a positive selection expreission cassette, preferably including a neomycin 
phosphotransferase gene operatively linked to a regulatory element(s). 
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Preferably, the first and second Tarzan gene sequences are derived from SEQ ID 
NO: 1, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 7 or SEQ ID NO: 10, or a 
homologue, variant or derivative thereof. 



Another aspect of the invention pertains to recombinant vectors into which the 
5 nucleic acid construct of the invention has been incorporated. Yet another aspect of the 
invention pertains to host cells into which the nucleic acid construct of the invention has 
been introduced to thereby allow homologous recombination between the nucleic acid 
construct and an endogenous Tarzan gene of the host cell, resulting in functional 
disruption of the endogenous Tarzan gene. The host cell can be a mammalian cell that 

1 0 normally expresses Tarzan from the liver, brain, spleen or heart, or a pluripotent cell, such 
as a mouse embryonic stem cell. Further development of an embryonic stem cell into 
which the nucleic acid construct has been introduced and homologously recombined with 
the endogenous Tarzan gene produces a transgenic nonhimian animal having cells that are 
descendant from the embryonic stem cell and thus carry the Tarzan gene disruption in 

15 their genome. Animals that carry the Tarzan gene disruption in their germline can then be 
selected and bred to produce animals having the Tarzan gene disruption in all somatic and 
germ cells. Such mice can then be bred to homozygosity for the Tarzan gene disruption. 

A TARZAN GPCR deficient transgenic animal may be generated as follows: 

Construction of TARZAN Gene Targeting Vector 



20 Murine TARZAN genomic clones are isolated from a mouse large insert PAC 

library obtained from HGMP (Hinxton, UK) using a probe sequence amplified from a part 
of the predicted murine open reading firame cDNA sequence (SEQ ID NO: 4), using 
standard techniques. The isolated murine TARZAN genomic clones are then restriction 
mapped in the region of the TARZAN gene using small oligonucleotide probes and 

25 standard techniques. 



The murine genomic locus is partially sequenced to enable the design of 
homologous arms to clone into the targeting vector. Two regions of DNA, typically 
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between 1 and Skb in size, from either side of the region of the open reading frame to be 
deleted, called the 5' and 3' homology arms, are amplified by PCR and the fragments are 
cloned into the targeting vector. The position of these arms is chosen so that a homologous 
recombination event will functionally disrupt the TARZAN gene by deleting at least the 
5 seven trans-membrane spanning regions. A targeting vector is prepared where the deleted 
TARZAN sequence is replaced with non-homologous sequences composed of an 
endogenous gene expression reporter (a frame-independent lacZ gene) upstream of a 
selection cassette composed of a promoted neomycin phosphotransferase (neo) gene, 
arranged in the same orientation as the TAR21AN gene. 

1 0 The mouse genomic region used in the knockout strategy for the Tarzan gene is 

depicted as SEQ ID NO: 10 

Transfection and Analysis of Embryonal Stem Cells 

Embryonal stem cells (Evans and Kaufman, 1981) are cultured on a neomycin 
resistant embryonal fibroblast feeder layer grown in Dulbecco's Modified Eagles medium 

15 supplemented with 20% Fetal Calf Serum, 10% new-bom calf serum, 2 mM glutamine, 
non-essential amino acids, 100 jxM 2-mercaptoethanol and 500 u/ml leukemia inhibitory 
factor. Medium is changed daily and ES cells are subcultured every three days. 5x10^ ES 
cells are transfected with 5 ^ig of linearized plasmid by electroporation (25 ^iF capacitance 
and 400 Volts). 24 hours following electroporation the transfected cells are cultured for 9 

20 days in medium containing 200 fig/ml neomycin. Clones are picked into 96 well plates, 
replicated and expanded before being screened by PCR to identify clones in which 
homologous recombination had occurred between the endogenous Tarzan gene and the 
targeting construct. Positive clones are typically identified at a rate of 1 to 5%. These 
clones where expanded to allow replicas to be frozen and sufficient high quality DNA to 

25 be prepared for Southern blot confirmation of the targeting event using external 5' and 3 ' 
probes, all using standard procedures (Russ et al, 2000, Nature 2000 Mar 2; 404(6773):95- 
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Generation ofTarzan GPCR Deficient Mice 

C57BL/6 female and male mice are mated and blastocysts are isolated at 3.5 days 
of gestation. 10-12 cells from a chosen clone are injected per blastocyst and 7-8 
blastocysts are implanted in the uterus of a pseudopregnant Fl female. 

5 A litter of chimeric pups are bom containing several high level (up to 100%) 

agouti males (the agouti coat colour indicates the contribution of cells descendent from the 
targeted clone). The male chimeras are mated with female and MFl and 129 mice, and 
germline transmission is determined by the agouti coat color and by PGR genotyping 
respectively. 

10 Antibodies 

The present mvention further provides for antibodies which bind to a Tarzan 
polypeptide, fragment, homologue, variant or derivative thereof. Particularly Tarzan 
expression, and in particular in diagnosing a Tarzan GPCR associated disease. Other 
preferred antibodies include those which have therapeutic activity, i.e., which are may be 
1 5 used in a therapeutic manner to treat, manage or prevent any Tarzan GPCR associated 
disease. 

For the purposes of this invention, the term "antibody", unless specified to the 
contrary, includes but is not limited to, polyclonal, monoclonal, chimeric, single chain. Fab 
fragments and fragments produced by a Fab expression library. Such fragments include 

20 fragments of whole antibodies which retain their binding activity for a target substance, Fv, 
F(ab') and F(ab')2 fragments, as well as single chain antibodies (scFv), fusion proteins and 
other synthetic proteins which comprise the antigen-binding site of the antibody. The 
antibodies and fragments thereof may be humanised antibodies, for example as described in 
EP-A-239400. Furthermore, antibodies with fiilly human variable regions (or their 

25 fragments), for example, as described in US Patent Nos. 5,545,807 and 6,075,181 may also 
be used. Neutralizing antibodies, i.e., those which inhibit biological activity of the 
substance amino acid sequences, are especially preferred for diagnostics and therapeutics. 
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Antibodies may be produced by standard techniques, such as by immunisation or 
by using a phage display library. 

A polypeptide or peptide of the present invention may be used to develop an 
antibody by known techniques. Such an antibody may be capable of binding specifically 
5 to the Tarzan GPCR protein or homologue, fragment, etc. 

If polyclonal antibodies are desired, a selected mammal (e.g., mouse, rabbit, goat, 
horse, etc.) may be immunised with an immunogenic composition comprising a 
polypeptide or peptide of the present mvention. Depending on the host species, various 
adjuvants may be used to increase inmiunological response. Such adjuvants include, but 

10 are not limited to, Freund's, mineral gels such as aluminium hydroxide, and surface active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
keyhole limpet hemocyanin, and dinitrophenol. BCG (Bacilli Calmette-Guerin) and 
Corynebacterium parvum are potentially useful human adjuvants which may be employed 
if purified the substance amino acid sequence is administered to immunologically 

15 compromised individuals for the purpose of stimulating systemic defence. 

Serum from the immunised animal is collected and treated according to known 
procedures. If serum containing polyclonal antibodies to an epitope obtainable from a 
polypeptide of the present invention contains antibodies to other antigens, the polyclonal 
antibodies can be purified by immunoaffinity chromatography. Techniques for producing 
20 and processing polyclonal antisera are known in the art. In order that such antibodies may 
be made, the invention also provides amino acid sequences of the invention or fragments 
thereof haptenised to another amino acid sequence for use as immunogens in animals or 
humans. 

Monoclonal antibodies directed against epitopes obtamable from a polypeptide or 
25 peptide of the present invention can also be readily produced by one skilled in the art. The 
general methodology for making monoclonal antibodies by hybridomas is well known. 
Inmiortal antibody-prodiicing cell lines can be created by cell fusion, and also by other 
techniques such as direct transformation of B lymphocytes with oncogenic DNA, or 



wo 02/0S8183 



PCT/GB02/01923 



61 

transfection with Epstein-Barr virus. Panels of monoclonal antibodies produced against 
orbit epitopes can be screened for various properties; i.e., for isotype and epitope affinity. 

Monoclonal antibpdies may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, 
5 but are not limited to, the hybridoma technique originally described by Koehler and 
Milstein (1975 Nature 256:495-497), the trioma technique, the human B-cell hybridoma 
technique (Kosbor et al (\ 983) Immunol Today 4:72; Cote et al (1 983) Proc Natl Acad Sci 
80:2026-2030) and the EBV-hybridoma technique (Cole et al.. Monoclonal Antibodies 
and Cancer Therapy, pp. 77-96, Alan R. Liss, Inc., 1985). 

10 In addition, techniques developed for the production of" ^chimeric antibodies", the 

splicing of mouse antibody genes to human antibody genes to obtain a molecule with 
appropriate antigen specificity and biological activity can be used (Morrison et al (1984) 
Proc Natl Acad Sci 81:6851-6855; Neuberger et al (1984) Nature 312:604-608; Takeda et 
al (1985) Nature 3 14:452-454). Alternatively, techniques described for the production of 

1 5 single chain antibodies (US Patent No. 4,946,779) can be adapted to produce the substance 
specific single chain antibodies. 

Antibodies, both monoclonal and polyclonal, which are directed against epitopes 
obtainable from a polypeptide or peptide of the present invention are particularly useful in 
diagnosis, and those which are neutralising are useful in passive immunotherapy. 
20 Monoclonal antibodies, in particular, may be used to raise anti-idiotype antibodies. Anti- 
idiotype antibodies are inmiimoglobulins which carry an "internal image" of the substance 
and/or agent against which protection is desired. Techniques for raising anti-idiotype 
antibodies are known in the art. These anti-idiotype antibodies may also be useful in 
therapy. 

25 Antibodies may also be produced by inducing in vivo production in the lymphocyte 

population or by screening recombinant immunoglobulin libraries or panels of highly 
specific binding reagents as disclosed in Oriandi et al (1989, Proc Natl Acad Sci 86: 3833- 
3837), and Winter G and Milstein C (1991; Nature 349:293-299). 
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Antibody fragments which contain specific binding sites for the polypeptide or 
peptide may also be generated. For example, such fragments include, but are not limited 
to, the F(ab')2 fragments which can be produced by pepsin digestion of the antibody 
molecule and the Fab fragments which can be generated by reducing the disulfide bridges 
5 of the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to 
allow rapid and easy identification of monoclonal Fab fragments with the desired 
specificity (Huse WD et al (1989) Science 256:1275-128 1). 

Techniques for the production of single chain antibodies (U.S. Pat. No. 4,946,778) 
can also be adapted to produce single chain antibodies to polypeptides of this invention. 
10 Also, transgenic mice, or other organisms including other mammals, may be used to 
express humanized antibodies. 

The above-described antibodies may be employed to isolate or to identify clones 
expressing the polypeptide or to purify the polypeptides by afiSnity chromatography. 

Antibodies against Tarzan GPCR polypeptides may also be employed to treat 
15 infections such as bacterial, fungal, protozoan and viral infections, particularly infections 
caused by HIV-1 or HIV-2; pain; cancers; diabetes, obesity; anorexia; bulimia; asthma; 
Parkinson's disease; thrombosis; acute heart failure; hypotension; hypertension; erectile 
dysfunction; urinary retention; metabolic bone diseases such as osteoporisis and osteo 
petrosis; angina pectoris; myocardial infarction; ulcers; asthma; allergies; rheimiatoid 
20 arthritis; inflannunatory bowel disease; irritable bowel syndrome benign prostatic 

hypertrophy; and psychotic and neurological disorders, including anxiety, schizophrenia, ^ 
manic depression, delirium, dementia, severe mental retardation and dyskinesias, such as 
Huntington's disease or Gilles dela Tourett's syndrome 

Diagnostic Assays 

25 This invention also relates to the use of Tarzan GPCR polynucleotides and 

polypeptides (as well as homologues, variants and derivatives thereof) for use in diagnosis 
as diagnostic reagents or in genetic analysis. Nucleic acids complementary to or capable of 
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hybridising to Tarzan GPCR nucleic acids (including homologues, variants and 
derivatives), as well as antibodies against Tarzan polypeptides are also useful in such 
assays. 

We provide for a natural variant of Tarzan polypeptide or nucleic acid, and the use 
5 of such a natural variant in diagnosis of Tarzan associated disease. Tarzan polymorphisms 
may include differences at the nucleic acid level, which may or may not reflect differences 
in the amino acid level. Preferably, such Tarzan variants or mutants are such that they 
include changes in the amino acid level. However, the invention also encompasses Tarzan 
polymorphisms which occur in non-coding regions, for example, expression control 
1 0 regions such as promoters and enhancers. 

Polymorphisms in Tarzan include deletions of one or more nucleic acids, insertions 
of one or more nucleic acids, inversions, etc. Preferably, Tarzan polymorphisms comprise 
single nucleotide polymorphisms. 

Polymorphisms in Tarzan may be identified by comparing sequences at the 
1 5 appropriate level (whether nucleic acid or protein) between individuals in a population. 
Differences in sequences may be reflected in different physical properties, and techniques 
for detecting these may rely on detection of changes in physical properties. For example, 
single nucleotide polymorphisms may be detected as restriction fragment length 
polymorphisms (i.e., difference in susceptiability to digestion by a restriction enzyme). 
20 Furthermore, SNPs may affect the migration or mobility of a nucleic acid fragment or 
protein fragment in a gel. 

Non-coding polymorphisms in Tarzan may be identified by sequencing non-coding 
regions of Tarzan. For example, control regions of the Tarzan gene, such as enhancers and 
promoters may be sequenced to identify polymorphisms. The effect of such non-coding 
25 polymorphisms on the expression level of Tarzan may be determined by constructing 
transgenic mice (as described below) comprising the mutant Tarzan sequences, or by 
generating expression constructs and transfection into cell lines. In each case, the 
expression level of Tarzan is detected, by RT-PCR or antibody Western staining, to 
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determine the effect of the mutation in the control of expression of Tarzan. Useful Tarzan 
polymorphisms are those which modulate the level of expression, wiether by up-regulation 
or down-regulation of Tarzan levels. 

Accordingly, this invention provides for a variant or mutant or polymorphism in a 
5 non-coding region of Tarzan, preferably in a control region of Tarzan, preferably in a 
promoter and/or enhancer of Tarzan, which is capable of modulating the level of 
expression of Tarzan in an organism. The invention also provides for a set of two or more 
of such mutants or variants or polymorphisms, preferably non-coding polymorphisms. The 
invention also provides for the use of such variants or polymorphisms or sets of variants to 
1 0 identify nucleic acid and/or amino acid positions, in which changes to such positions 
affect the level of expression of Tarzan. The invention also provides for a transgenic 
animal comprising a variant or mutant or polymorphism of Tarzan, preferably, a non- 
coding polymorphism. 

Detection of a mutated form of the Tarzan GPCR gene associated with a 
1 S dysfunction will provide a diagnostic tool that can add to or define a diagnosis of a disease 
or susceptibility to a disease which results from under-expression, over-expression or 
altered expression of Tarzan GPCR. Individuals carrying mutations in the Tarzan GPCR 
gene (including control sequences) may be detected at the DNA level by a variety of 
techniques. 

20 For example, DNA may be isolated from a patient and the DNA polymorphism 

pattern of Tarzan determined. The identified pattem is compared to controls of patients 
known to be suffering from a disease associated with over-, under- or abnormal expression 
of Tarzan. Patients expressing a genetic polymorphism pattem associated with Tarzan 
associated disease may then be identified. Genetic analysis of the Tarzan GPCR gene may 

25 be conducted by any technique known in the art. For example, individuals may be 

screened by determining DNA sequence of a Tarzan allele, by RFLP or SNP analysis, etc. 
Patients may be identified as having a genetic predisposition for a disease associated with 
the over-, under-, or abnormal expression of Tarzan by detecting the presence of a DNA 
polymorphism in the gene sequence for Tarzan or any sequence controlling its expression. 
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Patients so identified can then be treated to prevent the occurrence of Tarzan 
associated disease, or more aggressively in the early stages of Tarzan associated disease to 
prevent the further occurrence or development of the disease. Tarzan associated diseases 
include any one of trigeminal neuralgia, orofacial pain, pain associated with toothache, 
5 irritable bowel syndrome, Barrett's oesophagus, glaucoma, pain associated with cancer, 
diabetic neuropathies, Herpes infections, HIV infections, migraine and skin sensitivity 
associated with migraine, allodynia, toothache, neuroma (whether caused by amputation, 
nerve transaction or traimia), nerve compression (caused by tumours, entrapment or 
crush), pain due to damage of the spinal cord or brain; dementia, dyslexia, dyskinesias, 
10 tremor, Parkinson's, benign essential tremor, chorea, epilepsy and ballismus, for example 
occurring through stroke, trauma, degeneration or malignancy; cystic fibrosis and 
hyperactive bladder. 

The present invention further discloses a kit for the identification of a patient's 
genetic polymorphism pattern associated with Tarzan associated disease. The kit includes 
1 5 DNA sample collecting means and means for detennining a genetic polymorphism 

pattern, which is then compared to control samples to determine a patient's susceptibility 
to Tarzan associated disease. Kits for diagnosis of a Tarzan associated disease comprising 
Tarzan polypeptide and/or an antibody against such a polypeptide (or fi-agment of it) are 
also provided. 

20 Nucleic acids for diagnosis may be obtained fi-om a subject's cells, such as firom 

blood, urine, saliva, tissue biopsy or autopsy material. In a preferred embodiment, the 
DNA is obtained firom blood cells obtained fi-om a finger prick of the patient with the 
blood collected on absorbent paper In a fiirther preferred embodiment, the blood is 
collected on an AmpliCard.TM. (University of Sheffield, Department of Medicine and 

25 Pharmacology, Royal Hallamshire Hospital, Sheffield, England S 1 0 2JF). 

The DNA may be used directly for detection or may be amplified enzymatically by 
using PGR or other amplification techniques prior to analysis. Oligonucleotide DNA 
primers that target the specific polymorphic DNA region within the genes of interest may 
be prepared so that in the PGR reaction amplification of the target sequences is achieved. 



wo 02/088183 



PCT/GB02/01923 



66 

RNA or cDNA may also be used as templates in similar fashion. The amplified DNA 
sequences from the template DNA may then be analyzed using restriction enzymes to 
determine the genetic polymorphisms present in the amplified sequences and thereby 
provide a genetic polymorphism profile of the patient. Restriction fragments lengths may 
5 be identified by gel analysis. Alternatively, or in conjunction, techniques such as SNP 
(single nucleotide polymorphisms) analysis may be employed. 

Deletions and insertions can be detected by a change in size of the amplified 
product in comparison to the normal genotype. Point mutations can be identified by 
hybridizing amplified DNA to labeled Tarzan GPCR nucleotide sequences. Perfectly 

10 matched sequences can be distinguished from mismatched duplexes by RNase digestion or 
by differences in meltmg temperatures. DNA sequence differences may also be detected 
by alterations in electrophoretic mobility of DNA fragments in gels, with or without 
denaturing agents, or by direct DNA sequencing. See, eg,, Myers et al. Science 
(1985)230:1242. Sequence changes at specific locations may also be revealed by nuclease 

1 5 protection assays, such as RNase and SI protection or the chemical cleavage method. See 
Cotton et al., Proc Natl Acad Sci USA (1985) 85: 4397-4401. In another embodiment, an 
array of oligonucleotides probes comprising the Tarzan GPCR nucleotide sequence or 
fragments thereof can be constructed to conduct efficient screening of e.g., genetic 
mutations. Array technology methods are well known and have general applicability and 

20 can be used to address a variety of questions in molecular genetics including gene 
expression, genetic linkage, and genetic variability. (See for example: M.Chee et al.. 
Science, Vol 274, pp 610-613 (1 996)). 

Single strand conformation polymorphism (SSCP) may be used to detect 
differences in electrophoretic mobility between mutant and wild type nucleic acids (Orita 

25 et al. (1989) Proc Natl. Acad. Sci USA: 86:2766, see also Cotton (1993) Mutat Res 

285:125-144; and Hayashi (1992) Genet Anal Tech Appl 9:73-79). Single-stranded DNA 
Augments of sample and control Tarzan nucleic acids may be denatured and allowed to 
renature. The secondary structure of single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic mobility enables the detection of even 

30 a single base change. The DNA firagments may be labeled or detected wdth labeled probes. 
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The sensitivity of the assay may be enhanced by using RNA (rather than DNA), in which 
the secondary structure is more sensitive to a change in sequence. In a preferred 
embodiment, the subject method utilizes heteroduplex analysis to separate double stranded 
heteroduplex molecules on the basis of changes in electrophoretic mobility (Keen et al. 
5 (1991) Trends Genet 7:5). 

The diagnostic assays offer a process for diagnosing or determining a susceptibility 
to Tarzan associated diseases, for example, infections such as bacterial, fungal, protozoan 
and viral infections, particularly infections caused by HTV-l or HrV-2; pain; cancers; 
diabetes, obesity; anorexia; bulimia; asthma; parkinson's disease; thrombosis; acute heart 
failure; hypotension; hypertension; erectile dysfunction; urinary retention; metabolic bone 
diseases such as osteoporisis and osteo petrosis; angina pectoris; myocardial infarction; 
ulcers; asthma; allergies; rhexmiatoid arthritis; inflammatory bowel disease; irritable bowel 
syndrome benign prostatic hypertrophy; and psychotic and neurological disorders, 
including anxiety, schizophrenia, manic depression, delirium, dementia, severe mental 
retardation and dyskinesias, such as Himtington's disease or Gilles dela Tourett's 
syndrome. 

The presence of Tarzan GPCR polypeptides and nucleic acids may be detected in a 
sample. Thus, infections and diseases as listed above can be diagnosed by methods 
comprising determining from a sample derived from a subject an abnormally decreased or 
20 increased level of the Tarzan GPCR polypeptide or Tarzan GPCR mRNA. The sample 
may comprise a cell or tissue sample from an organism suffering or suspected to be 
suffering from a disease associated with increased, reduced or otherwise abnormal Tarzan 
GPCR expression, including spatial or temporal changes in level or pattem of expression. 
The level or pattem of expression of Tarzan in an organism suffering from or suspected to 
25 be suffering from such a disease may be usefully compared with the level or pattem of 
expression in a normal organism as a means of diagnosis of disease. 

In general therefore, the invention includes a method of detecting the presence of a 
nucleic acid comprising a Tarzan GPCR nucleic acid in a sample, by contacting the 
sample with at least one nucleic acid probe which is specific for said nucleic acid and 



10 



15 
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monitoring said sample for the presence of the nucleic acid. For example, the nucleic acid 
probe may specifically bind to the Tarzan GPCR nucleic acid, or a portion of it, and 
binding between the two detected; the presence of the complex itself may also be detected. 
Furthermore, the invention encompasses a method of detecting the presence of a Tarzan 
5 GPCR polypeptide by contacting a cell sample with an antibody capable of bindmg the 
polypeptide and monitoring said sample for the presence of the polypeptide. This may 
conveniently be achieved by monitoring the presence of a complex formed between the 
antibody and the polypeptide, or monitoring the binding between the polypeptide and the 
antibody. Methods of detecting binding between two entities are known in the art, and 
10 include FRET (fluorescence resonance energy transfer), surface plasmon resonance, etc. 



Decreased or increased expression can be measured at the RNA level using any of 
the methods well known in the art for the quantitation of polynucleotides, such as, for 
example, PCR, RT-PCR, RNase protection. Northern blotting and other hybridization 
methods. Assay techniques that can be used to determine levels of a protein, such as a 
1 5 Tarzan GPCR, in a sample derived from a host are well-known to those of skill in the art. 
Such assay methods include radioimmunoassays, competitive-binding assays, Western 
Blot analysis and ELISA assays. 



The present invention relates to a diagnostic kit for a disease or susceptibility to a 
Tarzan associated disease (including an infection), for example, infections such as 

20 bacterial, fungal, protozoan and viral infections, particularly infections caused by HIV-1 
or HrV-2; pain; cancers; diabetes, obesity; anorexia; bulimia; asthma; parkinson's disease; 
tiirombosis; acute heart failure; hypotension; hypertension; erectile dysfunction; urinary 
retention; metabolic bone diseases such as osteoporisis and osteo petrosis; angina pectoris; 
myocardial infarction; ulcers; asthma; allergies; rheumatoid arthritis; inflammatory bowel 

25 disease; irritable bowel syndrome benign prostatic hypertrophy; and psychotic and 
neurological disorders, mcluding anxiety, schizophrenia, manic depression, delirium, 
dementia, severe mental retardation and dyskinesias, such as Huntington's disease or 
Gilles dela Tourett's syndrome. 
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The diagnostic kit comprises a Tarzan GPCR polynucleotide or a fragment thereof; 
a complementary nucleotide sequence; a Tarzan GPCR polypeptide or a fragment thereof, 
or an antibody to a Tarzan GPCR polypeptide. 

Chromosome Assays 

The nucleotide sequences of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted to and can hybridize with 
a particular location on an individual human chromosome. As described above, human 
Tarzan GPCR is found to map to homo sapiens chromosome 13q32.2. 

The mapping of relevant sequences to chromosomes according to the present 
invention is an important first step in correlating those sequences with gene associated 
disease. Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map 
data. Such data are foimd, for ^example, in V. McKusick, Mendelian heritance in Man 
(available on line through Johns Hopkins University Welch Medical Library). The 
relationship between genes and diseases that have been mapped to the same chromosomal 
region are then identified through linkage analysis (coinheritance of physically adjacent 
genes). 

The differences in the cDNA or genomic sequence between affected and 
unaffected individuals can also be determined. If a mutation is observed in some or all of 
the affected individuals but not in any normal individuals, then the mutation is likely to be 
the causative agent of the disease. 

Prophylactic and Therapeutic Methods 

This invention provides methods of treating an abnormal conditions related to both 
an excess of and insufficient amounts of Tarzan GPCR activity. 
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If the activity of Tarzan GPCR is in excess, several approaches are available. One 
approach comprises administering to a subject an inhibitor compound (antagonist) as 
hereinabove described along with a pharmaceutically acceptable carrier in an amount 
effective to inhibit activation by blocking binding of ligands to the Tarzan GPCR, or by 
5 inhibiting a second signal, and thereby alleviating the abnormal condition. 

In another approach, soluble forms of Tarzan GPCR polypeptides still capable of 
binding the ligand in competition with endogenous Tarzan GPCR may be administered. 
Typical embodiments of such competitors comprise fragments of the Tarzan GPCR 
polypeptide. 

10 In still another approach, expression of the gene encoding endogenous Tarzan 

GPCR can be inhibited using expression blocking techniques. Known such techniques 
involve the use of antisense sequences, either internally generated or separately 
administered. See, for example, O^Coimot.J Neurochem (1991) 56:560 in 
Oligodeoxvnucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 

1 5 Raton, Fla. (1988). Alternatively, oligonucleotides which form triple helices with the gene 
can be supplied. See, for example, Lee et al., Nucleic Acids Res (1979) 6:3073; Cooney et 
al.. Science (1988) 241:456; Dervan et al., Science (1991) 251:1360. These oligomers can 
be administered per se or the relevant oligomers can be expressed in vivo. 

For treating abnormal conditions related to an under-expression of Tarzan GPCR 
20 and its activity, several approaches are also available. One approach comprises 

administering to a subject a therapeutically effective amount of a compound which 
activates Tarzan GPCR, i.e., an agonist as described above, in combination with a 
pharmaceutically acceptable carrier, to thereby alleviate the abnormal condition. 
Alternatively, gene therapy may be employed to effect the endogenous production of 
25 Tarzan GPCR by the relevant cells in the subject. For example, a polynucleotide of the 
invention may be engineered for expression in a replication defective retroviral vector, as 
discussed above. The retroviral expression construct may then be isolated and introduced 
into a packaging cell transduced with a retroviral plasmid vector containing RNA 
encoding a polypeptide of the present invention such that the packaging cell now produces 
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infectious viral particles containing the gene of interest. These producer cells may be 
administered to a subject for engineering cells in vivo and expression of the polypeptide in 
vivo. For overview of gene therapy,, see Chapter 20, Gene Therapy and other Molecular 
Genetic-based Therapeutic Approaches, (and references cited therein) in Human 
5 Molecular Genetics, T Strachan and A P Read, BIOS Scientific Publishers Ltd (1 996). 

Formulation and Administration 

Peptides, such as the soluble form of Tarzan GPCR polypeptides, and agonists and 
antagonist peptides or small' molecules, may be formulated in combination with a suitable 
pharmaceutical carrier. Such formulations comprise a therapeutically effective amount of 

10 the polypeptide or compound, and a phaimaceutically acceptable carrier or excipient. Such 
carriers include but are not limited to, saline, buffered saline, dextrose, water, glycerol, 
ethanol, aiid combinations thereof. Formulation should suit the mode of administration, 
and is well within the skill of the art. The invention further relates to pharmaceutical packs , 
and kits comprising one or more containers filled with one or more of the ingredients of 

1 5 the aforementioned compositions of the invention. 

Polypeptides and other compounds of the present invention may be employed 
alone or in conjunction with other compounds, such as therapeutic compounds. 

Preferred forms of systemic administration of the pharmaceutical compositions 
include injection, typically by intravenous injection. Other injection routes, such as 

20 subcutaneous, intramuscular, or intraperitoneal, can be used. Alternative means for 
systemic administration include transmucosal and transdermal administration using 
penetrants such as bile salts or fusidic acids or other detergents. In addition, if properly 
formulated in enteric or encapsulated formulations, oral administration may also be 
possible. Administration of these compounds may also be topical and/or localize, in the 

25 form of salves, pastes, gels and the like. 

The dosage range required depends on the choice of peptide, the route of 
• administration, the nature of the formulation, the nature of the subject's condition, and the 
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judgment of the attending practitioner. Suitable dosages, however, are in the range of 0.1- 
100 tig/kg of subject. Wide variations in the needed dosage, however, are to be expected 
in view of the variety of compounds available and the differing efficiencies of various 
routes of administration. For example, oral administration would be expected to require 
5 higher dosages than administration by intravenous injection. Variations in these dosage 
levels can be adjusted using standard empirical routines for optimization, as is well 
understood in the art. 

Polypeptides used in treatment can also be generated endogenously in the subject, 
in treatment modalities often referred to as "gene therapy" as described above. Thus, for 
10 example, cells from a subject may be engineered with a polynucleotide, such as a DNA or 
RNA, to encode a polypeptide ex vivo, and for example, by the use of a retroviral plasmid 
vector. The cells are then introduced into the subject 

Pharmaceutical Compositions 

The present invention also provides a pharmaceutical composition comprising 
1 5 administering a therapeutically effective amount of the polypeptide, polynucleotide, 
peptide, vector or antibody of the present invention and optionally a pharmaceutically 
acceptable carrier, diluent or excipients (including combinations thereof). 

The pharmaceutical compositions may be for human or animal usage in human and 
veterinary medicine and will typically comprise any one or more of a pharmaceutically 

20 acceptable diluent, carrier, or excipient. Acceptable carriers or diluents for therapeutic use 
are well known in the pharmaceutical art, and are described, for example, in Remington's 
Pharmaceutical Sciences, Mack Publishing Co. (A. R. Gennaro edit 1985). The choice of 
pharmaceutical carrier, excipient or diluent can be selected with regard to the intended 
route of administration and standard pharmaceutical practice. The pharmaceutical 

25 compositions may comprise as - or in addition to - the carrier, excipient or diluent any 

suitable binder(s), lubricant(s), suspending agent(s), coating agent(s), solubilising agent(s). 



wo 02/088183 



PCT/GB02/01923 



73 

Preservatives, stabilizers, dyes and even flavoring agents may be provided in the 
pharmaceutical composition. Examples of preservatives include sodium benzoate, sorbic 
acid and esters of p-hydroxybenzoic acid. Antioxidants and suspending agents may be also 
used. 

5 There may be different composition/formulation requirements dependent on the 

different delivery systems. By way of example, the pharmaceutical composition of the 
present invention may be formulated to be delivered using a a mini-pump or by a mucosal 
route, for example, as a nasial spray or aerosol for inhalation or ingestable solution, or 
paienterally in which the composition is formulated by an injectable form, for delivery, 
10 by, for example, an intravenous, intramuscular or subcutaneous route. Alternatively, the 
formulation may be designed to be delivered by both routes. 

Where the agent is to be delivered mucosally through the gastrointestinal mucosa, 
it should be able to remain stable during transit though the gastromtestinal tract; for 
example, it should be resistant to proteolytic degradation, stable at acid pH and resistant to 
15 the detergent effects of bile. 

Where appropriate, the pharmaceutical compositions can be administered by 
inhalation, in the form of a suppository or pessary, topically in the form of a lotion, 
solution, cream, ointment or dusting powder, by use of a skin patch, orally in the form of 
tablets containing excipients such as starch or lactose, or in capsules or ovules either alone 

20 or in admixture with excipients, or in the form of elixirs, solutions or suspensions 
containing flavouring or colouring agents, or they can be injected parenterally, for 
example intravenously, mtramuscularly or subcutaneously. For parenteral administration, 
the compositions may be best used in the form of a sterile aqueous solution which may 
contain other substances, for example enough salts or monosaccharides to make the 

25 solution isotonic with blood. For buccal or sublingual administration the compositions 
may be administered in flie form of tablets or lozenges which can be formulated in a 
conventional maimer. 
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Vaccines 

Another embodiment of the invention relates to a method for inducing an 
immunological response in a mammal which comprises inoculating the mammal with the 
Tarzan GPCR polypeptide, or a fragment thereof, adequate to produce antibody and/or T 
5 cell immune response to protect said animal from infections such as bacterial, fungal, 
protozoan and viral infections, particularly infections caused by HIV-1 or HIV-2; pain; 
cancers; diabetes, obesity; anorexia; bulimia; asthma; parkinson's disease; thrombosis; 
acute heart failure; hypotension; hypertension; erectile dysfunction; urinary retention; 
metabolic bone diseases such as osteoporisis and osteo petrosis; angina pectoris; 
10 myocardial infarction; ulcers; asthma; allergies; rheumatoid arthritis; inflammatory bowel 
disease; irritable bowel syndrome benign prostatic hypertrophy; and psychotic and 
neurological disorders, including anxiety, schizophrenia, manic depression, delirium, 
dementia, severe mental retardation and dyskinesias, such as Huntington's disease or 
Gilles dela Tourett's syndrome, among others. 

1 5 Yet another embodiment of the invention relates to a method of inducing 

immunological response in a mammal which comprises delivering a Tarzan GPCR 
polypeptide via a vector directing expression of a Tarzan GPCR polynucleotide in vivo in 
order to induce such an immunological response to produce antibody to protect said 
animal from diseases. 

20 A further embodiment of the invention relates to an immunological/vaccine 

formulation (composition) which, when introduced into a mammalian host, induces an 
immimological response in that mammal to a Tarzan GPCR polypeptide wherein the 
composition comprises a Tarzan GPCR polypeptide or Tarzan GPCR gene. The vaccine 
formulation may further comprise a suitable carrier. 

25 Since the Tarzan GPCR polypeptide may be broken down in the stomach, it is 

preferably administered parenterally (including subcutaneous, intram\iscular, intravenous, 
mtradermal etc. injection). Formulations suitable for parenteral administration include 
aqueous and non-aqueous sterile injection solutions which may contain anti-oxidants. 
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buffers, bacteriostats and solutes which render the formulation instonic with the blood of 
the recipient; and aqueous and non-aqueous sterile suspensions which may include 
suspending agents or thickening agents. The formulations may be presented in unit-dose or 
multi-dose containers, for example, sealed ampoules and vials and may be stored in a 
5 freeze-dried condition requiring only the addition of the sterile liquid carrier immediately 
prior to use. The vaccine formulation may also include adjuvant systems for enhancing the 
immunogenicity of the formulation, such as oil-in water systems and other systems known 
in the art. The dosage will depend on the specific activity of the vaccine and can be readily 
determined by routine experimentation. 

1 0 Vaccines may be prepared from one or more polypeptides or peptides of the 

present invention. 

The preparation of vaccines which contain an inmnmogenic polypeptide(s) or 
peptide(s) as active ingredient(s), is known to one skilled in the art. Typically, such 
vaccines are prepared as injectables, either as liquid solutions or suspensions; solid forms 

15 suitable for solution in, or suspension in, liquid prior to injection may also be prepared. 
The preparation may also be emulsified, or the protein encapsulated in liposomes. The 
active immunogenic ingredients are often mixed with excipients which are 
pharmaceutically acceptable and compatible with the active ingredient. Suitable excipients 
are, for example, water, saline, dextrose, glycerol, ethanol, or the like and combinations 

20 thereof 

In addition, if desired, the vaccme may contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH buffering agents, and/or adjuvants 
which enhance the effectiveness of the vaccine. Examples of adjuvants which may be 
effective include but are not limited to: aluminum hydroxide, N-acetyl-muramyl-L- 
25 threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine 
(CGP 1 1637, referred to as nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L- 
alanine-2-(r-2'-dipahnitoyl-sn-glycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 
19835A, referred to as MTP-PE), and RIBI, which contains three components extracted 
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from bacteria, monophosphoryl lipid A, trehalose dimycolate and cell wall skeleton 
(MPL+TDM+CWS) in a 2% squalene/Tween 80 emulsion. 

Further examples of adjuvants and other agents include aluminum hydroxide, 
aluminum phosphate, aluminum potassium sulfate (alum), beryllium sulfate, silica, kaolin, 
carbon, water-in-oil emulsions, oil-in-water emulsions, muramyl dipeptide, bacterial 
endotoxin, lipid X, Corynebacterium parvum (Propionobacterium acnes), Bordetella 
pertussis, polyribonucleotides, sodium alginate, lanolin, lysolecithin, vitamin A, saponm, 
liposomes, levamisole, DEAE-dextran, blocked copolymers or other synthetic adjuvants. 
Such adjuvants are available commercially from various sources, for example, Merck 
Adjuvant 65 (Merck and Company, Inc., Rahway, N.J.) or Freund's Incomplete Adjuvant 
and Complete Adjuvant (Difco Laboratories, Detroit, Michigan). 

Typically, adjuvants such as Amphigen (oil-in-water), Alhydrogel (aluminum 
hydroxide), or a mixture of Amphigen and Alhydrogel are used. Only aluminxmi 
hydroxide is approved for human use. 

The proportion of immunogen and adjuvant can be varied over a broad range so 
long as both are present in effective amounts. For example, aluminum hydroxide can be 
present in an amount of about 0.5% of the vaccine mixture (AI2O3 basis). Conveniently, 
the vaccines are formulated to contain a final concentration of immunogen in the range of 
from 0.2 to 200 jig/ml, preferably 5 to 50 |ag/ml, most preferably 15 fig/ml. 

After formulation, the vaccine may be incorporated mto a sterile container which is 
then sealed and stored at a low temperature, for example 4°C, or it may be freeze-dried. 
Lyophilisation permits long-temi storage in a stabilised form. 

The vaccines are conventionally administered parenterally, by injection, for 
example, either subcutaneously or intramuscularly. Additional formulations which are 
suitable for other modes of administration include suppositories and, in some cases, oral 
formulations. For suppositories, traditional binders and carriers may include, for example, 
polyalkylene glycols or triglycerides; such suppositories may be formed from mixtures 



wo 02/088183 PCT/GB02/01923 

77 

containing the active ingredient in the range of 0.5% to 10%, preferably 1% to 2%. Oral 
formulations include such normally employed excipients as, for example, pharmaceutical 
grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, 
magnesium carbonate, and the like. These compositions take the form of solutions, 
5 suspensions, tablets, pills, capsules, sustained release formulations or powders and contain 
10% to 95% of active ingredient, preferably 25% to 70%. Where the vaccine composition 
is lyophilised, the lyophilised material may be reconstituted prior to administration, e.g. as 
a suspension. Reconstitution is preferably effected in buffer 

Capsules, tablets and pills for oral administration to a patient may be provided with 
1 0 an enteric coating comprising, for example, Eudragit "S", Eudragit "L*', cellulose acetate, 
cellulose acetate phthalate or hydroxypropylmethyl cellulose. 

The polypeptides of the invention may be formulated into the vaccine as neutral or 
salt forms. Pharmaceutically acceptable salts include the acid addition salts (formed with 
free amino groups of the peptide) and which are formed with inorganic acids such as, for 
15 example, hydrochloric or phosphoric acids, or such organic acids such as acetic, oxalic, 
tartaric and maleic. Salts formed with the free carboxyl groups may also be derived from 
inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino 
ethanol, histidine and procaine. 

20 Administration 

Typically, a physician will determine the actual dosage which will be most suitable 
for an individual subject and it will vary with the age, weight and response of the 
particular patient. The dosages below are exemplary of the average case. There can, of 
course, be individual instances where higher or lower dosage ranges are merited. 

25 The pharmaceutical and vaccine compositions of the present invention may be 

administered by direct injection. The composition may be formulated for parenteral, 
mucosal, intramuscular, intravenous, subcutaneous, intraocular or transdermal 
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administration. Typically, each protein may be administered at a dose of from 0.01 to 30 
mg/kg body weight, preferably from 0.1 to 10 mg/kg, more preferably from 0.1 to 1 mg/kg 
body weight. 

The term "administered" includes delivery by viral or non-viral techniques. Viral 
5 delivery mechanisms include but are not limited to adenoviral vectors, adeno-associated viral 
(AAV) vectos, herpes viral vectors, retroviral vectors, lentiviral vectors, and baculoviral 
vectors. Non-viral delivery mechanisms include lipid mediated transfection, liposomes, 
immunoliposomes, lipofectin, cationic facial amphiphiles (CPAs) and combinations thereof. 
The routes for such delivery mechanisms include but are not limited to mucosal, nasal, 
10 oral, parenteral, gastrointestinal, topical, or sublingual routes. 

The term "administered" includes but is not limited to delivery by a mucosal route, 
for example, as a nasal spray or aerosol for inhalation or as an ingestable solution; a 
parenteral route where delivery is by an injectable form, such as, for example, an 
intravenous, intramuscular or subcutaneous route. 

15 The term "co-administered" means that the site and time of administration of each 

of for example, the polypeptide of the present invention and an additional entity such as 
adjuvant are such that the necessary modulation of the immune system is achieved. Thus, 
whilst the polypeptide and the adjuvant may be administered at the same moment in time 
and at the same site, there may be advantages in administering the polypeptide at a 

20 different time and to a different site from the adjuvant. The polypeptide and adjuvant may 
even be delivered in the same delivery vehicle - and the polypeptide and the antigen may 
be coupled and/or uncoupled and/or genetically coupled and/or uncoupled. 

The polypeptide, polynucleotide, peptide, nucleotide, antibody of the invention and 
optionally an adjuvant may be administered separately or co-administered to the host 
25 subject as a single dose or in multiple doses. 



The vaccine composition and pharmaceutical compositions of the present invention 
may be administered by a number of different routes such as injection (which includes 
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parenteral, subcutaneoxis and intramuscular injection) intranasal, mucosal, oral, intra- 
vaginal, urethral or ocular administration. 

The vaccines and pharmaceutical compositions of the present invention may be 
conventionally administered parenterally, by injection, for example, either subcutaneously 
5 or intramuscularly. Additional formulations which are suitable for other modes of 
administration include suppositories and, in some cases, oral fonnulations. For 
suppositories, traditional binders and carriers may include, for example, polyalkylene 
glycols or triglycerides; such suppositories may be formed from mixtures containing the 
active ingredient in the range of 0.5% to 10%, may be 1% to 2%. Oral formulations 

10 include such normally employed excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium steamte, sodium saccharine, cellulose, magnesium 
carbonate, and the like. These compositions take the form of solutions, sxispensions, 
tablets, pills, capsules, sustained release formulations or powders and contain 10% to 95% 
of active ingredient, preferably 25% to 70%. Where the vaccine composition is 

1 5 lyophilised, the lyophilised material may be reconstituted prior to administration, e.g. as a 
suspension. Reconstitution is preferably effected in buffer. 

Examples 

Example L Transgenic TARZAN Knock-Out Mouse 
Construction of Tarzan Gene Targeting Vector 

20 Murine Tarzan genomic clones are isolated from a mouse large insert PAC library 

obtained from HGMP (Hinxton, UK) using a probe sequence amplified from the predicted 
murine open reading frame pDNA sequence (SEQ ID NO: 4), using standard techniques. 
The primers musTarzan Fl and Rl are used to generate the probe. 

The isolated murine Tarzan genomic clones are then restriction mapped in the 
25 region of the Tarzan gene using small oligonucleotide probes and standard techniques. The 
murine genomic locus is partially sequenced to enable the design of homologous arms to 
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clone into the targeting vector. The structure of the targeting vector, including its 
restriction sites, is shown in Figure 5. 

The murine Tarzan gene has a single coding exon. The targeting strategy is 
designed to remove the majority of this exon. A 1 .1 kb 5* homologous arm and a 3.7 kb 3' 
5 homologous arm flanking the coding exon are amplified by PGR and the fragments are 
cloned into the targeting vector. The 5' end of each oligonucleotide primer used to amplify 
the arms is synthesized to contain a different recognition site for a rare-cutting restriction 
enzyme, compatible with the cloning sites of the vector polylinkers and absent from the 
arms themselves. In the case of Tarzan, the primers are designed as listed in the sequence 
10 table below, with 5' arm cloning enzymes of Agel/NotI and 3' arm cloning enzymes of 
AscI/FseL 



In addition to the arm primer pairs (5'amiF/5*armR and 3*armF/3*armR), fiirther 
primers specific to the Tarzan locus are designed for the foUov^ng purposes: 5* and 3* 
probe primer pairs (5'prF/5'prR and 3'prF/3'prR) to amplify two short 200-300bp 

15 fragments of non-repetitive genomic DNA external to and extending beyond each arm, to 
allow Southern analysis of the targeted locus, in isolated putative targeted clones; a mouse 
genotyping primer pair (hetF and hetR) which allows differentiation between wild-type, 
heterozygote and homozygous mice, when used in a multiplex PGR with a vector specific 
primer, in this case, Asc306; and lastly, a target screening primer (5*scr.2) which anneals 

20 upstream of the end of the 5' arm region, and which produces a target event specific 1 .2kb 
amplimer when paired with a primer specific to the 5* end of the vector (DRl). This 
amplimer can only be derived from template DNA from cells where the desired genomic 
alteration has occurred and allows the identification of correctly targeted cells from the 
backgroimd of clones containing randomly mtegrated copies of the vector. The location of 

25 these primers and the genomic structure of the TARZAN locus used in the targeting 
strategy is shown in SEQ ID NO: 10. 

musTanan 5*prF TTTGATGGAGTAGATTCAGTGTCAGAC 
nwsTarzan 5'prR GTCCCAGAAGAGCTTGGGAATCCAAAC 
musTarzan 5*arinF.2 Age tttaccgGTTCCTACATTAGCGACTTTTCTGGAC 
musTarzan 5'armR Not aaagcggccgCCAGGAAATCCGAATCACTGGCTGGAC 
musTanan 3*arniF Asc aaaggcgcgccGATGCAAAGCCAGTGGGGACCTTGAAC 
musTarzan 3*armR Fsa tttggccGGcCTGAGCATGCATGGCAGTTTCCTTG 
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musTarzan S'scr^ 
musTarzan 3'prF 
musTarzan 3'prR 
musTarzan hetF 
musTarzan hetR 
musTarzan F1 
musTarzan R1 
Asc306 
DR1 



AATCATCCTTCTTGCTGGGGTTCAGAC 

CGTGGCACTCAAGTCTGGATGCACCTG 

AGACACTAGGACATTAATCCCTGTGAC 

TCCACATCTTGAG6GTCATTCGGATCG 

TTCTTTCCTTGTTCAAGGTCCCCACTG 

GGATTTCCTGGATTACCCAAGTGC 

CAGATACAGCAAGTCCGTCAAGGC 

AATGGCCGCTTTTCTGGATTCATCGAC 

CATGCCGCCTGCGCCCTAHGATCATG 



Table h Tanan Primer Sequences 

The position of the homology arms is chosen to functionally disrupt the Tarzan 
gene by deleting the seven transmembrane spanning regions. A targeting vector is 
prepared where the deleted Tarzan sequence is replaced with non-homologous sequences 
5 composed of an endogenous gene expression reporter (a frame independent lacZ gene) 
upstream of a selection cassette composed of a promoted neomycin phosphotransferase 
(neo) gene arranged in the same orientation as the Tarzan gene. 



Once the 5' and 3' homology arms had been cloned into the targeting vector 
pTKSIBLMNL (see Figure 5), a large highly pure DNA preparation is made using 
1 0 standard molecular biology techniques. 20 \ig of the freshly prepared endotoxin free DNA 
is restricted with another rare-cutting restriction enzyme Pmel, present at a unique site in 
the vector backbone between the amplicillin resistance gene and the bacterial origin of 
replication. The linearized DNA is then precipitated and resuspended in 100 jil of 
Phosphate Buffered Saline, ready for electroporation. 



15 24 hours following electroporation the transfected cells are cultured for 9 days in 

medium containing 200 ^ig/ml neomycin. Clones are picked into 96 well plates, replicated 
and expanded before being screened by PGR (using primers 5'scr.2 and DRl, as described 
above) to identify clones in which homologous recombination had occurred between the 
endogenous Tarzan gene and the targeting construct. Positive clones can be identified at a 

20 rate of 1 to 5%. These clones are expanded to allow replicas to be frozen and sufficient 
high quality DNA to be prepared for Southern blot confirmation of the targeting event 
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using the external 5' and 3' probes prepared as described above, all using standard 
procedures (Russ et al, 2000, Nature 2000 Mar 2; 404(6773):95-9). 

The structure of the genomic locus of mouse Tarzan before knock-put is depicted 
in Figure 3. The structure of the genomic locus of mouse Tarzan after knock-out is 
5 depicted in Figure 4. 

Generation of Tarzan GPCR Deficient Mice 

CS7BL/6 female and male mice are mated and blastocysts are isolated at 3.5 days 
of gestation. 10-12 cells from a chosen clone are injected per blastocyst and 7-8 
blastocysts are implanted in the uterus of a pseudopregnant Fl female. A litter of chimeric 
10 pups are bom containing several high level (up to 100%) agouti males (the agouti coat 
colour indicates the contribution of cells descendent from the targeted clone). These male 
chimeras are mated with female and MFl and 129 mice, and germline transmission is 
determined by the agouti coat colour and by PGR genotyping respectively. 

Genotyping is carried out by PGR on lysed tail clips, using the primers hetF and 
1 5 hetR with a third, vector specific primer (Asc306). This multiplex PGR allows 

amplification from the wild-type locus (if present) from primers hetF and hetR giving a 
220 bp band. The site for hetF is deleted in the knockout mice, so this amplification will 
fail from a targeted allele. However, the Asc306 primer will amplify a 343 bp band from 
the targeted locus, in combination with the hetR primer which anneals to a region just 
20 inside the 3* arm. Therefore, this multiplex PGR reveals the genotype of the litters as 

follows: wild-type samples will exhibit a single 220 bp band; heterozygous DNA samples 
yield two bands at 220 bp and 343 bp; and the homozygous samples will show only the 
target specific 343 bp band. 

LacZ Staining 



25 



The X gal staining of dissected tissues is performed in the following maimer. 



wo 02/088183 



PCT/GB02/01923 



83 

Representative tissue slices are made of large organs. Whole small organs and 
tubes are sliced open, so fixative and stain will penetrate. Tissues are rinsed thoroughly in 
PBS (phosphate buffered saline) to remove blood or gut contents. Tissues are placed in 
fixative (PBS containing 2% formaldehyde, 0.2% glutaraldehyde, 0.02% NP40, ImM 
5 MgC12, Sodium deoxycholate 0.23mM) for 30-45 minutes. Following three 5 minute 
washes in PBS, tissues are placed in Xgal staining solution (4mM K Ferrocyanide, 
4mMKFerricyanide, 2mM MgC12, Img/mlX-gal in PBS) for 18 hours at 30C. Tissues are 
PBS washed 3 times, postfixed for 24 hours in 4% formaldehyde, PBS washed again 
before storage in 70% ethanol. 



10 To identify Xgal stained tissues, dehydrated tissues are wax embedded, and 7um 

section sections cut, counterstained with 0.01% Safiranin (9-10 min). 

Behavioural and Neurological Testing 

Mice are housed under a 12h lights-12h dark light schedule (lights-on at 6 am) 
with free access to food and water. Mice (n=12), of mixed sexes, aged 3 to 4 months old, 
15 are submitted to behavioural testing during the morning, between lOh and 13h to avoid 
any differential circadian effect on the test results. 

Example 2. Expression of Recombinant TARZAN Protein 

Recombinant TARZAN is expressed and purified. Two systems are used for 
expression. 

20 pTOPO'Echo Donor Based Construct 

A polynucleotide having the sequence shown in SEQ ID NO: 2 is obtained from 
the human TARZAN nucleic acid sequence (SEQ ID NO: 1). The SEQ ID NO: 2 
polynucleotide is cloned into a pTOPO-Echo Donor vector module (Invitrogen 
pUniV5/His Cat# ETOOl-10). Transfection of the resulting construct into a host strain and 
25 induction of expression (according to the manufacturer's instructions) yields a fusion 
protein having the sequence of SEQ ID NO: 6. 
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The fiision polypeptide SEQ ID NO: 6 contains a C terminal V5 tag (residues 344 
to 357) and His tag (residues 358 to 363) to aid detection and purification. 

pCDNAS-JE Based Construct 

A polynucleotide having the sequence shown in SEQ ID NO: 7 is amplified by 
5 PGR using the oligonucleotide primers 

AATAAAGCTAGCACC ATGAATGAGCCACTAGAC and 
ATAATCTCGAGTCAAGGGTTGTTTGAGTAACTAATTTTC to incorporate new 
restriction sites, Nhel and Xhol at the 5-prime and 3 -prime ends respectively of Tarzan. 
This is then digested and ligated into similarly digested pcDNA5-JE (Invitrogen Cat# - 
10 K6010-01 vector modified to remove BGH Poly-A). 

The resulting construct is used for high level expression in CHO-Kl cells , and 
other mammalian cell lines, under the control of the cmv promoter to yield a native 
polypeptide. 

A polynucleotide having the sequence shown in SEQ ID NO: 2 is amplified by 
1 5 PGR using the oligonucleotide primers 

AATAAAGGTAGGAGCATGAATGAGGCACTAGAG and 

ATAATCTGGAGTGAGTTATCGTCGTCATGGTTGTAATCAGGGTTGTTTGAGTAA 
GTAATTTTC to incorporate new restriction sites, Nhel and Xhol at the 5-prime and 3- 
prime ends respectively of Tarzan and to include a 3' fiision FLAG tag. This is then 
20 digested and ligated into similarly digested pcDNA5-JE (Invitrogen Gat# - K6010-01 
vector modified to remove iBGH Poly-A). 

The resulting construct is used for high level expression in GHO-Kl cells, and 
other mammalian cell lines, under the control of the cmv promoter to yield a fiision 
polypeptide with C terminal FLAG tag (double underline) to aid detection and 
25 pxirification. The resultant expressed fusion polypeptide has a sequence shown in SEQ ID 
NO: 8. 

Introduction of Construct into Cells 
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The expression vector is introduced to the cells by lipofection (using Fugene-6 
from Roche, Cat# 1 814 433) among other similar methods. 

Both transient and stable transfection of these cells is achieved. In transient 
expression the cells are transfected by lipofection using a large amount of vector which 
5 results in a short-lived fast expression of the protein. In a stable transfection, the vector, 
which includes a selectable marker for neomycin resistance becomes stably integrated into 
the genome of the host cell resulting in a long lived cell line with a high expression level 
of Tarzan. 



Cells expressing recombinant TAR21AN are used for assay development, antibody 
10 production, and other purposes as described. 

Expression in Other Host Cells 

The recombinant/fusion clone SEQ ID NO: 2 is recombined into a pB AD-Thio-E 
fector (Invitrogen Cat# ETl 00-01) for high level bacterial expression under control of the 
ard&AD promoter, using a Cre/Lox mediated recombination system. 

1 5 The lecombinant/fusion clone SEQ ID NO: 2 is recombined into a pBlueBac 4.5E 

(Invitrogen Cat# ET3 10-01), using a Cre/Lox mediated recombination system, for 
subsequent recombination into Baculovirus expression systems. Recombination into 
MaxBac (Invitrogen Cat# K875-02) for high level expression in SF9 and other insect cell 
lines. 



20 The recombinant/fiision clone SEQ ID NO: 2 is recombined into pcDNA 3.1-E 

(Invitrogen Cat# ET400-01) , using a Cre/Lox mediated recombination system, for high 
level expression in CHO-Kl (Chinese Hamster Ovary) cells, and other mammalian cell 
lines, under the control of the cmv promoter. 
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Example 3. Expression of TARZAN in the Thyroid and Brain 

LacZ staining in Thyroid gland and the brain of the mice can be detected. The 
thyroid is an essential organ in controlling energy metabolism and this expression supports 
a role for this target in obesity related disorders. 

Example 4. Analysis of Blood Samples from TARZAN Knock-Out Mice 

Analysis of blood samples taken from mutant TARZAN animals can show an 
altered level of circulating metabolites and fatty acids. This coupled with the expression 
seen in the thyroid, may imply a use for the TARZAN protein in the development of 
therapeutics to treat hypercholesterolaemia, dislipdaemias, obesity or drugs to afTect 
energy regulation. 

Example 5. Motor Control and Balance in TARZAN Knock-Out Mouse 

TARZAN mutant mice can be used to test the role of the protein in motor co- 
ordination and balance by their performance on the Rotarod apparatus. 

Motor co-ordination and balance are measured by performance on the rotarod. We 
use an accelerating Rotarod (Ugo Basile, Linton instruments, Jones BJ, Roberts DJ. The 
quantiative measurement of motor inco-ordination in naive mice using an acelerating 
rotarod, JPharm Pharmacol 1968 Apr;20(4):302-4. We also use the Rotarod to assess 
motor learning by repeating the task over several days. 

Mice are placed on the Rotarod, which accelerates at a constant rate from 4 to 40 
rpm in S in. As the Rotarod reaches higher speed, the mice often grip the Rotarod and 
hang on for a fiill rotation (i.e. passive rotation). The time at which the mouse makes one 
full rotation is recorded. Mice are left on the Rotarod after the first passive rotation and 
allowed to perform the rest of the test until they drop from the rod. Mice are given 3 trials 
on three consecutive days. 
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Example 6. Visual Ability, Anxiety and Mobility in TARZAN Knock-Out Mouse 

Tests for visual ability, anxiety and mobility in a TARZAN knock-out mouse are 
conducted using a visual cliff test. 

The Visual cliff is developed by Fox (Fox M.WAmm Behav. 1965 Apr- 
5 Jul; 1 3(2):232-3) and provides a measure of gross visual ability. It evaluates the ability of 
the mouse to see the drop-off at the edge of a horizontal surface. Time for the animal to 
move one placed on the cliff (latency) is a measure of anxiety. 

A Perspex box is built with a horizontal plane connected to a vertical drop of 0.5m. 
A black and white chequerboard pattern accentuates the vertical drop-out. A sheet of clear 
10 Perspex is placed across the top horizontal cliff, extending over the top, thus there is the 
visual appearance of a cliff," but in fact the Perspex provides a solid horizontal surface. 

Each mouse is given 10 trials on the visual cliff. A trial consists of placing the 
mouse on the centre 'ridge' and noting the time taken from the animal to move off the 
'ridge' (latency) and recording the side on which the mouse stepped. The result is 
15 considered positive when the animal chooses to walk on the chequered "safe" side and 
avoids the cliff and negative result for the other way round. After 5 trials the box is turned 
through 180 degrees and a fiirther 5 trials given. This is done to eliminate the variable of 
the position of the observer. The platform is cleaned between each animal. 

Example 7. Tests for Sensitivity to External Stimuli and Pain (Analgesia Testing) in 
20 TARZAN Knock-Out Mouse: Paw Pressure Test 

TARZAN mutants can be tested for their sensitivity to touch and pain stimuli using 
a range of tests. The tests in this and the following four Examples assess both neuropathic 
and inflammatory pain. 

Skin sensitivity is assessed by applying pressure on the hindpaw with a sharpened 
25 dowel rod whilst the animal is resting on a grid. Responses are graded as follow: 0 = no 
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withdrawal; 1 =slow withdrawal of the paw; 2 = medium withdrawal of the paw; 3 = fast 
withdrawal of the paw. 

Example 8. Tests for Sensitivity to External Stimuli and Pain (Analgesia Testing) in 
TARZAN Knock-Out Mouse: Tail Flick Test 

5 A tail flick analgesia test is performed using a Tail-Flick Analgesia Meter. This 

equipment provides an easy to use method to determine pain sensitivity accurately and 
reproducibly in rodents (D'Amour and Smith, 1941 Journal of Pharmacology and 
Experimental Therapeutics. Vol 41 p4 19-424), The instrument has a shutter-controlled 
lamp as a heat source. The lamp is located below the animal to provide a less confining 
10 environment. Tail flick is detected by the automatic detection circuitry, which leaves the 
user's hands free to handle the animal. The animal is restrained in a ventilated tube and its 
tail placed on a sensing groove on top of the equipment. 

Activation of an intense light beam to the tail through opening of the shutter results 
in discomfort at some point when the animal will flick its tail out of the beam. In the 
1 5 automatic mode a photo-detector detects the tail motion causing the clock to stop and the 
shutter to close. The total time elapsed between the shutter opening and the animal's 
reaction is recorded. 

Responses of mutant transgenic mice are compared with age and sex matched 
wild-type mice. A single animal may be subjected to different heat settings to produce an 
20 increase in tail temperature no greater than 55^C. 

Using the tail flick test TARZAN mutants can be tested for their sensitivity to heat 
induced pain and compated to their wild-type counterparts. Decreased sensitivity in 
mutants is associated by the withdrawal of their tails after a longer time period of exposure 
to the heat source. 



wo 02/088183 



PCT/GB02/01923 



89 

Example 9. Tests for Sensitivity to External Stimuli and Pain (Analgesia Testing) in 
TARZAN.Knock-Out Mouse: Formalin Test 

The formalin test measures the response to a noxious substances injected into a 
hind paw. A volume of 20p.l of a 1% formalin solution is injected through a fine gauge 
5 needle subcutaneously into the dorsal surface of one hindpaw. Licking and biting the 
hindpaw is quantitated as cumulative number of seconds engaged in the behaviours. A 
rating scale is used: 1= the formalin injected paw rests lightly on the floor bearing less 
weight; 2= the injected paw is elevated; 3= the injected paw is licked, bitten or shaken. 



Two phases of responses are seen in the formalin test. Phase 1 begins immediately 
10 after injection and lasts about 10 mins, representmg the acute burst of activity from pain 
fibres. Phase two begins about 20 mins after injection and continues for about one hour. 
This phase appears to represent responses to tissue damage, including inflammatory 
hyperalgesia. 

Using the formalin test TARZAN mutants may be shown to be less sensitive to 
1 5 inflammatory pain and may show a reduced severity of response in the formalin test when 
compared to wild type animals. 



Example 10. Tests for the Role of TARZAN in Energy Metabolism 

TARZAN mutant mice may be used to test the function of the TARZAN protein in 
energy metabolism by the determination of total body oxygen consiunption (V02) (a 

20 parameter of energy expenditure) through indirect calorimetry. Oxygen is required for 
combustion of fuels. By measuring total body oxygen consumption the total amount of 
fuel used can be calculated. Energy expenditure is measured by estimating oxidation rates 
of macronutrients from rates of respiratory exchange of oxygen and carbon dioxide 
(Vidal-Puig AJ, Grujic C, Zhang CY, Hagen T, Boss 0, Ido Y, Szczepanik A, Wade J, 

25 Mootha V, Cortright R: Energy Metabolism m Uncoupling Protein 3 Gene Knockout Mice 
. J Biol Chem 275:16258-16266, 2000). 
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These measurements provide information not only about the total energy 
expenditure but also about preferential use of specific nutrients as a source of energy. 
Energy expenditure is measured in basal conditions and in response to some established 
modulators (e.g. fasting would produce a decrease in energy expenditure), specific diet 
5 manipulation (e.g. High fat diet should increase energy expenditure), or pharmacological 
treatment (e.g. adrenergic receptor agonists should induce energy expenditure (8)). This 
information determines whether energy expenditure is altered in TARZAN mutants and 
provides insights on the mechanisms involved (e.g. oxidation of fatty acids vs 
carbohydrates etc). These measurements are performed in calorimeters with individual 
10 chambers. 

Thus oxygen consumption and respiratory exchange ratio are measured (AA) in 
basal, fasted (24/36h), and other experimental conditions (e.g. diets, pharmacological 
treatments) using a calorimeter (e.g. Columbus Instruments, Columbus, OH). Standard 
conditions of the measurements are settling time 100 sec, measuring time 50 sec, with 
room air as reference. However, these parameters are optimised specifically for every 
experiment. Air in the respiratory chamber is continuously exchanged. This chamber has 
two sensors that measure the difference between the oxygen and CO2 that enters and 
leaves the chamber. Experiments may require up to 1-hour calibration time, period during 
which animals can be placed in the chamber. Thus a single animal is deposited in an 
individual chamber and left undisturbed during the time required for the experiment. The 
length of the experiment would be variable (2-9 h) depending on its specific nature (e.g. 
fed or fasted animals will be in the chamber during 2h while Lipopolisacaride injected 
animals will require 6-8 hours). 

Differences in reading between TARZAN mutant and wild type mice indicate a 
25 role for the TARZAN protein in metabolism and indicate the use of this target for the 
development of therapeutics tackling disorders of hypercholesterolaemia, dislipdaemias 
and obesity. 

Each of the applications and patents mentioned in this document, and each 
document cited or referenced in each of the above applications and patents, including 



15 



20 
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during the prosecution of each of the applications and patents ("application cited 
documents") and any manufacturer's instructions or catalogues for any products cited or 
mentioned in each of the applications and patents and in any of the application cited 
documents, are hereby incorporated herein by reference. Furtfiermore, all documents cited 
5 in this text, and all documents cited or referenced in documents cited in this text, and any 
manufacturer's instructions or catalogues for any products cited or mentioned in this text, 
are hereby incorporated herein by reference. 

Various modifications and variations of the described methods and system of the 
invention will be apparent to those skilled in the art without departing from the scope and 
1 0 spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications 
of the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the claims. 
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Sequences 

SEQ ID NO: 1 Human TARZAN cDNA 
hucDNA 

5 

TTTAACTTTTATATTTTTATTAGATGCATTTAGTAACTTGCCTCATAGTCATTTTCTTGGAAATTC 
AATTTCTTCTCCACAGGGTCTCTTTTGAGATTAAAGAGAGAGAAGTGGCAAATTTAGGATGTTAGA 
ATAATTTTCATTT/WU^GTAGATCCTTGTTTTTATTACCCTATCATTAATGTTTTCTGTTTTCCTT 
TATCAGCGAGTTACTGCTCATTTGATTCATATTGCCAAACTGAACTCTCTTGTTTTCTTGCAAGAT 

10 GAAAGGAGACAACCATGAATGAGCCACTAGACTATTTAGCAAATGCTTCTGATTTCCCCGATTATG 
CAGCTGCTTTTGGAAATTGCACTGATGAAAACATCCCACTCAAGATGCACTACCTCCCTGTTATTT 
ATGGCATTATCTTCCTCGTGGGATTTCCAGGCAATGCAGTAGTGATATCCACTTACATTTTCAAAA 
TGAGACCTTGGAAGAGCAGCACCATCATTATGCTGAACCTGGCCTGCACAGATCTGCTGTATCTGA 
CCAGCCTCCCCTTCCTGATTCACTACTATGCCAGTGGCGAAAACTGGATCTTTGGAGATTTCATGT 

15 GTAAGTTTATCCGCTTCAGCTTCCATTTCAACCTGTATAGCAGCATCCTCTTCCTCACCTGTTTCA 
GCATCTTCCGCTACTGTGTGATCATTCACCCAATGAGCTGCTTTTCCATTCACAAAACTCGATGTG 
CAGTTGTAGCCTGTGCTGTGGTGTGGATCATTTCACTGGTAGCTGTCATTCCGATGACCTTCTTGA 
TCACATCAACCAACAGGACCAACAGATCAGCCTGTCTCGACCTCACCAGTTCGGATGAACTCAATA 
CTATTAAGTGGTACAACCTGATTTTGACTGCAACTACTTTCTGCCTCCCCTTGGTGATAGTGACAC 

20 TTTGCTATACCACGATTATCCACACTCTGACCCATGGACTGCAAACTGACAGCTGCCTTAAGCAGA 
AAGCACGAAGGCTAACCATTCTGCTACTCCTTGCATTTTACGTATGTTTTTTACCCTTCCATATCT 
TGAGGGTCATTCGGATCGAATCTCGCCTGCTTTCAATCAGTTGTTCCATTGAGAATCAGATCCATG 
AAGCTTACATCGTTTCTAGACCATTAGCTGCTCTGAACACCTTTGGTAACCTGTTACTATATGTGG 
TGGTCAGCGACAACTTTCAGCAGGCTGTCTGCTCAACAGTGAGATGCAAAGTAAGCGGGAACCTTG 

25 AGCAAGCAAAGAAAATTAGTTACTCAAACAACCCTTGAAATATTTCATTTACTTAACCAAAAACAA 
ATACTTGCTGATACTTTACCTAGCATCCTAAGATGTTCAGGATGTCTCCCTCAATGGAACTCCTGG 
TAAAtACTGTGTATTCAAGTAATCATGTGCCAAAGCCAGGGCAGAGCTTCTAGTTCTTTGCAATCC 
CTTTATTGAGCTCCTCCACTGGGGAGATATAAGAATGGGATGCATGTATATCAGCAAAGTATTCAG 
ACATAGTATTACAAGCTATTGGAACTCAGAGGCATCTTAGAGAACATCTGTTCCCACCAACTTACT 

30 ATATATACACGGAAACCAATTTCTTACCCTTGCCCTAGATTGCTCAGTAAATTTGTGCCAAGATAG 
GAGAAAACCAATCTTTTCACTCATCATTTCATGCTTCTCTGCACTCTGGGCCTATTTGTATTGAAC 
CATTAGACAATTCAAACCACTACTTGTATCTTTCTTAATATTTATTTTTTACATCTCAGAGCTCTA 
CAATTTGTTTCCTTC 



35 <210> 1 

<211> 1179 

<212> DNA 

<213> Homo sapiens 



40 <400> 1 
ggagacaacc 
tgcagctgct 
tgttatttat 
ttacattttc 

45 cacagatctg 
aaactggatc 
gtatagcagc 
cccaatgagc 
gtggatcatt 

50 gaccaacaga 
gtacaacctg 
ctataccacg 



atgaatgagc 
tttggaaatt 
ggcattatct 
aaaatgagac 
ctgtatctga 
tttggagatt 
atcctcttcc 
tgcttttcca 
tcactggtag 
tcagcctgtc 
attttgactg 
attatccaca 



cactagacta 
gcactgatga 
tcctcgtggg 
cttggaagag 
ccagcctccc 
tcatgtgtaa 
tcacctgttt 
ttcacaaaac 
ctgtcattcc 
tcgacctcac 
caactacttt 
ctctgaccca 



tttagcaaat 
aaacatccca 
atttccaggc 
cagcaccatc 
cttcctgatt 
gtttatccgc 
cagcatcttc 
tcgatgtgca 
gatgaccttc 
cagttcggat 
ctgcctcccc 
tggactgcaa 



gcttctgatt 
ctcaagatgc 
aatgcagtag 
attatgctga 
cactactatg 
ttcagcttcc 
cgctactgtg 
gttgtagcct 
ttgatcacat 
gaactcaata 
ttggtgatag 
actgacagct 



tccccgatta 
actacctccc 
tgatatccac 
acctggcctg 
ccagtggcga 
atttcaacct 
tgatcattca 
gtgctgtggt 
caaccaacag 
ctattaagtg 
tgacactttg 
gccttaagca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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gaaagcacga aggctaacca ttctgctact ccttgcattt tacgtatgtt ttttaccctt 780 

ccatatcttg agggtcattc ggatcgaatc tcgcctgctt tcaatcagtt gttccattga 840 

gaatcagatc catgaagctt acatcgtttc tagaccatta gctgctctga acacctttgg 900 

taacctgtta ctatatgtgg tggtcagcga caactttcag caggctgtct gctcaacagt 960 

gagatgcaaa gtaagcggga accttgagca agcaaagaaa attagttact caaacaaccc 1020 

ttgaaatatt tcatttactt aaccaaaaac aaatacttgc tgatacttta cctagcatcc 1080 

taagatgttc aggatgtctc cctcaatgga actcctggta aatactgtgt attcaagtaa 1140 

tcatgtgcca aagccagggc agagcttcta gttctttgc 1179 



10 

SEQIDNO: 2 ORF from SEQ ID NO: 1 

ATGAATGAGCCACTAGACTATTTAGCAAATGCTTCTGATTTCCCCGATTATGCAGCTGCTTTTGGA 
AATTGCACTGATGAAAACATCCCACTCAAGATGCACTACCTCCCTGTTATTTATGGCATTATCTTC 
15 CTCGTGGGATTTCCAGGCAATGCAGTAGTGATATCCACTTACATTTTCAAAATGAGACCTTGGAAG 

AGCAGCACCATCATTATGCTGAACCTGGCCTGCACAGATCTGCTGTATCTGACCAGCCTCCCCTTC 
CTGATTCACTACTATGCCAGTGGCGAAAACTGGATCTTTGGAGATTTCATGTGW.GTTTATCCGC 
TTCAGCTTCCATTTCAJ^X'CTGTATAGCAGCATCCTCTTCCTCACCTGTTTCAGCATCTTCCGCTAC 
TGTGTGAT(:ATTCACCCAATGAGCTGCTTTTCCATTCACAAi2^J^CTCGATGTGCAGTTGTAGCCTGT 

20 GCTGTGGTGTGGATCATTTCACTGGTAGCTGTCATTCCGATGACCTTCTTGATCACATCAACCAAC 
AGGACCAACAGATCAGCCTGTCTCGACCTCACCAGTTCGGATGAJi.CTCAATACTATTAAGTGGTAC 
AACCTGATTTTGACTGCAACTACTTTCTGCCTCCCCTTGGTGATAGTGACACTTTGCTATACCACG 
ATTATCCACACTCTGACCCATGGACTGCAAACTGACAGCTGCCTTAJIGCAGAAJVGCACGAJ\GGCTA 
ACCATTCTGCTACTCCTTGCATTTTACGTATGTTTTTTACCCTTCCATA.TCTTGAGGGTCATTCGG 

25 ATCGAATCTCGCCTGCTTTCAJ\TCAGTTGTTCCATTGAGAATCAGATCCATGAJ\GCTTACATCGTT 
TCTAGACCATTAGCTGCTCTGAACACCTTTGGTAACCTGTTACTATATGTGGTGGTCAGCGACAAC 
TTTCAGCAGGCTGTCTGCTCAACAGTGAGATGCAAAGTAAGCGGGAACCTTGAGCAAGCAAAGAAA 
ATTAGTTACTCAAACAACCCTTGA 



30 <210> 2 

<211> 1014 

<212> DNA 

<213> Homo sapiens 



35 <400> 2 

atgaatgagc cactagacta tttagcaaat 
tttggaaatt gcactgatga aaacatccca 
ggcattatct tcctcgtggg atttccaggc 
aaaatgagac cttggaagag cagcaccatc 

40 ctgtatctga ccagcctccc cttcctgatt 
tttggagatt tcatgtgtaa gtttatccgc 
atcctcttcc tcacctgttt cagcatcttc 
tgcttttcca ttcacaaaac tcgatgtgca 
tcactggtag ctgtcattcc gatgaccttc 

45 tcagcctgtc tcgacctcac cagttcggat 
attttgactg caactacttt ctgcctcccc 
attatccaca ctctgaccca tggactgcaa 
aggctaacca ttctgctact ccttgcattt 
agggtcattc ggatcgaatc tcgcctgctt 

50 catgaagctt acatcgtttc tagaccatta 
ctatatgtgg tggtcagcga caactttcag 
gtaagcggga accttgagca agcaaagaaa 



gcttctgatt tccccgatta tgcagctgct 60 

ctcaagatgc actacctccc tgttatttat 120 

aatgcagtag tgatatccac ttacattttc 180 

attatgctga acctggcctg cacagatctg 24 0 

cactactatg ccagtggcga aaactggatc 300 

ttcagcttcc atttcaacct gtatagcagc 360 

cgctactgtg tgatcattca cccaatgagc 420 

gttgtagcct gtgctgtggt gtggatcatt 480 

ttgatcacat caaccaacag gaccaacaga 54 0 

gaactcaata ctattaagtg gtacaacctg 600 

ttggtgatag tgacactttg ctataccacg 660 

actgacagct gccttaagca gaaagcacga 720 

tacgtatgtt ttttaccctt ccatatcttg 780 

tcaatcagtt gttccattga gaatcagatc 840 

gctgctctga acacctttgg taacctgtta 900 

caggctgtct gctcaacagt gagatgcaaa 960 

attagttact caaacaaccc ttga 1014 



55 



SEQ ED NO: 3 peptide sequence translation of above 
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MNEPLDYLANASDFPDYAAAFGNCTDENIPLKMHYLPVIYGIIFLVGFPGNAVVISTYIFKMRPWK 
SSTIIMLNLACTDLLYLTSLPFLIHYYASGENWIFGDFMCKFIRFSFHFNLYSSILFLTCFSIFRY 
CVIIHPMSCFSIHKTRCAWACAVVWIISLVAVIPMTFLITSTNRTNRSACLDLTSSDELNTIKWY 
NLILTATTFCLPLVIVTLCYTTIIHTLTHGLQTDSCLKQKARRLTILLLLAFYVCFLPFHILRVIR 
5 lESRLLSISCSIENQIHEAYIVSRPLAALNTFGNLLLYVWSDNFQQAVCSTVRCKVSGNLEQAKK 

ISYSNNPZ 

<210> 3 

<211> 337 

10 <212> PRT. 

<213> Homo sapiens 

<400> 3 

^3et Asn Glu Pro Leu Asp Tyr Leu Ala Asn Ala Ser Asp Phe Pro Asp 
15 1 5 10 15 

Tyr Ala Ala Ala Phe Gly Asn Cys Thr Asp Glu Asn He Pro Leu Lys 
20 25 30 

20 Met His Tyr Leu Pro Val He Tyr Gly He He Phe Leu Val Gly Phe 
35 40 45 

Pro Gly Asn Ala Val Val He Ser Thr Tyr He Phe Lys Met Arg Pro 

50 55 60 . 

25 

Trp Lys Ser Ser Thr He He Met Leu Asn Leu Ala Cys Thr Asp Leu 
65. 70 75 80 

Leu Tyr Leu Thr Ser Leu Pro Phe Leu He His Tyr Tyr Ala Ser Gly 
30 85 90 95 

Glu Asn Trp He Phe Gly Asp Phe Met Cys Lys Phe He Arg Phe Ser 
100 105 110 

35 Phe His Phe Asn Leu Tyr Ser Ser He Leu Phe Leu Thr Cys Phe Ser 
115 120 125 

He Phe Arg Tyr Cys Val He He His Pro Met Ser Cys Phe Ser lie. 
130 135 140 

40 

His Lys Thr Arg Cys Ala Val Val Ala Cys Ala Val Val Trp He He 
145 150 155 160 

Ser Leu Val Ala Val He Pro Met Thr Phe Leu He Thr Ser Thr Asn 
45 165 170 175 

Arg Thr Asn Arg Ser Ala Cys Leu Asp Leu Thr Ser Ser Asp Glu Leu 
180 185 190 

50 Asn Thr He Lys Trp Tyr Asn Leu He Leu Thr Ala Thr Thr Phe Cys 
195 200 205 

Leu Pro Leu Val He Val Thr Leu Cys Tyr Thr Thr He He His Thr 
210 215 220 

55 

Leu Thr His Gly Leu Gin Thr Asp Ser Cys Leu Lys Gin Lys Ala Arg 
225 230 235 240 



wo 02/088183 



PCT/GB02/01923 



95 

Arq Leu Thr lie Leu Leu Leu Leu Ala Phe Tyr Val Cys Phe Leu Pro 
245 250 255 

Phe His lie Leu Arg Val He Arg He Glu Ser Arg Leu Leu Ser He 

5 260 265 270 

Ser Cys Ser He Glu Asn Gin He His Glu Ala Tyr He Val Ser Arg 

275 280 " 285 

10 Pro Leu Ala Ala Leu Asn Thr Phe Gly Asn Leu Leu Leu Tyr Val Val 

290 295 300 



15 



20 



25 



35 



Val Ser Asp Asn Phe Gin Gin Ala Val Cys Ser Thr Val Arg Cys Lys 
305 310 315 320 

Val Ser Gly Asn Leu Glu Gin Ala Lys Lys He Ser Tyr Ser Asn Asn 
325 330 335 

Pro 



SEQIDNO: 4 mouse Tarzan ORF 

ATGATTGAGCCACTGGACAGTCCAGCCAGTGATTCGGATTTCCTGGATTACCCAAGTGCTCTGGGA 
AACTGCACCGACGAGCAAATCTCATTCAAGATGCAGTACCTTCCCGTCATCTATAGCATCATCTTC 
CTCGTGGGCTTCCCGGGGAACACAGTGGCCATCTCCATCTACATTTTCAAGATGCGGCCGTGGAGG 
GGCAGTACAGTCATCATGCTGAACCTGGCCTTGACGGACTTGCTGTATCTGACCAGCCTCCCGTTC 
CXrATrCATTACTATGCCAGTGGTGAAAACTGGATCTTTGGAGATTTCATGTGCAAGTTCATCCGC 
TTCGGCTTCCACTTCAACCTCTACAGCAGCATTCTCTTCCTCACCTGCTTCAGTCTCTTCCGTTAC 
30 GTTGTGATrATTCACCCGATGAGCTGCTTTTCTATTCAGAAAACTCGCTGGGCAGTGGTAGCTTGT 
GC^GGGGTGTGGGTCATTTCTTTGGTAGCTGTCATGCCCATGACTTTCCTGATCACATCAACCACC 
CGGACCAATAGGTCTGCTTGCCTTGACCTCACCAGTTCAGATGACCTCACTACTATCAAGTGGTAC 
AATCTCATTTTGACAGCCACCACTTTCTGCCTGCCATTGGTGATAGTGACACTTTGCTACACGACA 
ATTATrAGTACCCTGACTCACGGGCCTCGGACCCACAGCTGCTTTAAGCAGAAGGCTCGGAGACTG 
ACTATTCTGCTCCTCCTTGTTTTCTATATATGTTTCTTACCCTTCCACATCTTGAGGGTCATTCGG 
ATCGAATCTCGCCTGCTTTCAATCAGCTGCTCCATCGAGAGTCACATCCACGAAGCTTACATTGTT 
TCTAGACCATTAGCTGCTCTCAACACCTTTGGCAACCTGCTGTTATATGTTGTGGTCAGCAATAAC. 
TTCCAGCAGGCATTCTGCTCTATAGTGAGATGCAAAGCCAGTGGGGACCTTGAACAAGGAAAGAAA 

GACAGTTGCTCAAACAACCCTTGA 

40 

<210> 4 
<211> 501 
<212> DNA 
45 <213> Mus mus cuius 

<400> 4 

atgattgagc cactggacag tccagccagt gattcggatt tcctggatta cccaagtgct 
ctgggaaact gcaccgacga gcaaatctca ttcaagatgc agtaccttcc cgtcatctat 

50 agcatcatct tcctcgtggg cttcccgggg aacacagtgg ccatctccat ctacattttc 
aagatgcggc cgtggagggg cagtacagtc atcatgctga acctggcctt gacggacttg 
ctgtatctga ccagcctccc gttcctcatc cattactatg ccagtggtga aaactggatc 
tttggagatt tcatgtgcaa gttcatccgc ttcggcttcc acttcaacct ctacagcagc 
attctcttcc tcacctgctt cagtctcttc cgttacgttg tgatcattca cccgatgagc 420 

55 tgcttttcta ttcagaaaac tcgctgggca gtggtagctt gtgccggggt gtgggtcatt 480 
tctttggtag ctgtcatgcc c 



60 

120 

180 

240 

300 

360 
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SEQ ID NO: 5 peptide sequence translation of above 

5 MIEPLDSPASDSDFLDYPSALGNCTDEQISFKMQYLPVIYSIIFLVGFPGNTVAISIYIFKMRPWR 
GSTVIMLNLALTDLLYLTSLPFLIHYYASGENWIFGDFMCKFIRFGFHFNLYSSILFLTCFSLFRY 
WIIHPMSCFSIQKTRWAWACAGVWVISLVAVMPMTFLITSTTRTNRSACLDLTSSDDLTTIKWY 
NLILTATTFCLPLVIVTLCYTTIISTLTHGPRTHSCFKQKARRLTILLLLVFYICFLPFHILRVIR 
lESRLLSISCSIESHIHEAYIVSRPLAALNTFGNLLLYWVSNNFQQAFCSIVRCKASGDLEQGKK 
10 DSCSNNPZ 

<210> 5 

<211> 167 

<212> PRT 

IS <213> Mus musculus 

<400> 5 

Met He Glu Pro Leu Asp Ser Pro Ala Ser Asp Ser Asp Phe Leu Asp 
15 10 15 

20 

Tyr Pro Ser Ala Leu Gly Asn Cys Thr Asp Glu Gin He Ser Phe Lys 
20 25 30 

Met Gin Tyr Leu Pro Val He Tyr Ser He He Phe Leu Val Gly Phe 
25 35 40 45 

Pro Gly Asn Thr Val Ala He Ser He Tyr He Phe Lys Met Arg Pro 
50 55 60 

30 Trp Arg Gly Ser Thr Val He Met Leu Asn Leu Ala Leu Thr Asp Leu 
65 70 75 80 

Leu Tyr Leu Thr Ser Leu Pro Phe Leu He His Tyr Tyr Ala Ser Gly 
85 90 95 

35 

Glu Asn Trp He Phe Gly Asp Phe Met Cys Lys Phe He Arg Phe Gly 
100 105 110 

Phe His Phe Asn Leu Tyr Ser Ser He Leu Phe Leu Thr Cys Phe Ser 
40 115 120 125 

Leu Phe Arg Tyr Val Val He He His Pro Met Ser Cys Phe Ser He . 
130 135 140 

45 Gin Lys Thr Arg Trp Ala Val Val Ala Cys Ala Gly Val Trp Val He 
145 150 155 160 

Ser Leu Val Ala Val Met Pro 
165 

50 

SEQ ID NO: 6 Fusion protein of TARZAN with V5 and His tags at c-teminal end. 

MNEPLDYLANASDFPDYAAAFGNCTDENIPLKMHYLPVIYGIIFLVGFPGNAVVISTYIFKMRPWK 
55 SSTIIMLNLACTDLLYLTSLPFLIHYYASGENWIFGDFMCKFIRFSFHFNLYSSILFLTCFSIFRY 
CVIIHPMSCFSIHKTRCAWACAWWIISLVAVIPMTFLITSTNRTNRSACLDLTSSDELNTIKWY 
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NLILTATTFCLPLVIVTLCYTTIIHTLTHGLQTDSCLKQKARRLTILLLLAFYVCFLPFHILRVIR 
lESRLLSISCSIENQIHEAYIVSRPLAALNTFGNLLLYVWSDNFQQAVCSTVRCKVSGNLEQAKK 
ISYSNNP RAIRELGKPIPNPLLGLDSSHHHHHH 



SEQIDNO:? 

GCTAGCACCATGAATGAGCCACTAGACTATTTAGCAAATGCTTCTGATTTCCCCGATTATGCAGCT 
GCTTTTGGAAATTGCACTGATGAAAACATCCCACTCAAGATGCACTACCTCCCTGTTATTTATGGC 
ATTATCTTCCTCGTGGGATTTCCAGGCAATGCAGTAGTGATATCCACTTACATTTTCAAAATGAGA 

10 CCTTGGAAGAGCAGCACCATCATTATGCTGAACCTGGCCTGCACAGATCTGCTGTATCTGACCAGC 
CTCCCCTTCCTGATTCACTACTATGCCAGTGGCGAAAACTGGATCTTTGGAGATTTCATGTGTAAG 
TTTATCCGCTTCAGCTTCCATTTCAACCTGTATAGCAGCATCCTCTTCCTCACCTGTTTCAGCATC 
TTCCGCTACTGTGTGATCATTCACCCAATGAGCTGCTTTTCCATTCACAAAACTCGATGTGCAGTT 
GTAGCCTGTGCTGTGGTGTGGATCATTTCACTGGTAGCTGTCATTCCGATGACCTTCTTGATCACA 

15 TCAACCAACAGGACCAACAGATCAGCCTGTCTCGACCTCACCAGTTCGGATGAACTCAATACTATT 
AAGTGGTACAACCTGATTTTGACTGCAACTACTTTCTGCCTCCCCTTGGTGATAGTGACACTTTGC 
TATACCACGATTATCCACACTCTGACCCATGGACTGC7VAACTGACAGCTGCCTTAAGCAGAAAGCA 
CGAAGGCTAACCATTCTGCTACTCCTTGCATTTTACGTATGTTTTTTACCCTTCCATATCTTGAGG 
GTCATTCGGATCGAATCTCGCCTGCTTTC7\ATCAGTTGTTCCATTGAGAATCAGATCCATGAAGCT 

20 TACATCGTTTCTAGACCATTAGCTGCTCTGAACACCTTTGGTAACCTGTTACTATATGTGGTGGTC 
AGCGACAACTTTCAGCAGGCTGTCTGCTCAACAGTGAGATGCAAAGTAAGCGGGAACCTTGAGCAA 
GCAAAGAAAATTAGTTACTCAAACAACCCTTGACTCGAG 

SEQroN0:8 

25 MNEPLDYLANASDFPDYAAAFGNCTDENIPLKMHYLPVIYGIIFLVGFPGNAWISTYIFKMRPWK 
SSTIIMLNLACTDLLYLTSLPFLIHYYASGENWIFGDFMCKFIRFSFHFNLYSSILFLTCFSIFRY 
CVIIHPMSCFSIHKTRCAWACAWWIISLVAVIPMTFLITSTNRTNRSACLDLTSSDELNTIKWY 
NLILTATTFCLPLVIVTLCYTTIIHTLTHGLQTDSCLKQKARRLTILLLLAFYVCFLPFHILRVIR 
lESRLLSISCSIENQIHEAYIVSRPLAALNTFGNLLLYVWSDNFQQAVCSTVRCKVSGNLEQAKK 

30 TSYSNNPDYKDDDDK 



SEQ ID NO: 10 genomic locus from 5'prF to 3'prR 



5 



35 



40 



45 





29900 

TCATCT 

3 0000 
iC^CCC T 

<S'prR 



30100 



50 




55 



60 



30400 



65 



70 
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30700 

- HMHNMHTOWMlTOHNWllWNNMMNin^^ ICCitJCiLLi O-'i CC I CCTC 

30800 

C ltC 'l XX 'll Xn t r ii:tl'CLIV C l'C€l X: ti LC I C rTCC4' CLi '' i 't 1 ICn^ ^ ^ Itl in ICntlTtl iH 1L1 I ' rA ATIkATAATGCACTXTGAA CJUUUI 

30900 

ACMmMOCSAOGtfUlTGGC7rCCATnG6CTR?r%TATCCTGAAT^^ 

TQTKh k T lVC lV L 'rX k iAJailOGTht^^ ItH lUAtI llgCIVITlX« AffirTCXau: C C S TCCrGGXCCTCC 
15 31000 

Z\) ACi\ K ;i LVCri'AaXaCATTACGGACTCrAAGCXnTrCJUU^TGAT^ 1 lUX ' 1 iUG TCATGGTAATCTCTC 

TGACGACaXSAATOXTCIAATGCCTCAGAriCOGAAACTTrACT A r^^ ^ ^ 

>5*axnP.2 

25 >S'acr.2 >S*ann 

i I 31200 

AT(XaU^CA A Tre C CT A CTCAGACATTTAATIAGGAAAAAATaiTC ClTt ' i HiClO GGgrTCACACAAA A HriTC C f A CATrACCGA Li 11 ALil^ ACTTC 
TACCITCrrAACCGATCACTCrCTAAATIA Al ' LLl i 1 1 1 iA CTAGCAAGAACGACCCCAA GAC.IUi i 1 lA CAAGCATCTM^TOOCrrCfcAAAGACCTGAAG 

31300 

SATCTCIUyrA g TCrT CA AC l C i 'l jtfl C rA CCAAA lTAri T A CAGCTT M ^^ 
ATCACJU^ClTOAGACCMaTCMTTrAATAAATCTCCA A TT C TT OT 

35 31400 

ACCAGCTtnTTTTCITCSTIGCITAAACCTGIGATAAACTTATATT^ 
TCGTCGACAAAAAC»UiCAAC{»ATTTGSACACTAT1TCAATATAAAA6ACATAm 

An 

4U CTATTCATCrT(rrGGAAATGCAAAACTGCTGAATTAGTATACACTGGCTACC^^ 

CATAACTAGAACA LVm A C Cni n\J lC(aCTTAATCATATCTCACCCATC^ 1 A CA CUI 1 U VCCCATATAAAATGTAGATTCTCTAAATGCAAG 

31600 

J, - AAAGAATAACCTCATAACTACCTA CllCltTl l lA ATACTCATAAGGCCAC^ 

4^ mci ' i ' Ai ' i ccitfgATiaATxxawautfwauuu^TT^ nvci xicbuicra-iA ACTCTrA 

3170O 

CC*TAGGaiTOOGTCCGC»TCTCTroA tTit:riU^ ^ 

31B0O 

CrCTTAACTATCACATTACACOBAOQAACTCACCCATAAACCGACAOG A AC A ' i I lt.1 nU AAGAAATTAAAACAfflCTTOOGAOAOCTGCAATITGACrrT 
(aGAATTCATACTCTAATOCGCTCCTTGA GitAAJiW^ ^ 

55 31900 
GAGTCAACXXX:ACATTCTACA tf iX* "J' iHX AATTCATSACAGA r t VA ' l V lTtKi ' l U AGTCTACA i ' i ' m ' l VlVl ' A AATGCXSTGTGT XJ i'GI'G lG TGTGTCTGTGT 
CTCACTTGGGGTCn'AAGATCmJ^CAACGTrAACTACTCriXrrAAGT^^ 

^^^^^ 

OU (j ' tmtjmii:^ iV iVlVlUlt.lUlO I XXATTa^TTCACCAg lU - lXXXn^lXSG AACTTAGAGCTTAAACTTSA Ult. ICL H tC i tA CATATCACCAACCTACC 

CACg^CACACACACACACftC3^CACAOGTM^CT3U«nqgroAC^^ 

32100 

tTnVitAi ' l 'C J tfWCTATCrCATrOCTTtKaCCTGCTItfffJ^^ ri ' l IV l' CCA CAC Ai ' lCitl ' l lU^ ACATTAAACACaAAGAOA C 

CAAACACCatfaCCPlTACACTAAOQAACCTCGACGATCATCGAAClACCCTCACT^ ^ I IVIV I M 

33J00 

ACA(»CATAGTA 0O3 TgrCACfcGCCATC0CrAaatf3Trai llU14VZTiUXA TtaAT ^ 1 lt:itX r illX;iTiU ^SCTOUM' lXJ ^CTTrCGCTCC»CTCAC 
ICT Cf tt T A TCATOOCACA tt T C lO GC t A COGATOCTCAACTAACAACA?^ ^ 

70 4 > 

>P1 <5*ax«A 

/3 ATOSCCAAG0CAAA C ' l\;i ' iLa.l 1 ItCI iG CAAGATCaUU«auWXacCCATGATTCSACCCACrGGACAGTC(^^ 

TACOSCT l ' U.Ui I H. AGAACGqAAAGGiUkOeiTCTA Li I lVrilt.tlt>i Cn'ACTAACtCG(nGACCTCTCAGGTaxnt^ 

MIBPXiDSPASOSDPLOY> 
KUSTARZAN > 

80 * ' 

>7cai 
I 

33400 

O- CCCAA&nT^CTC TC OGAAACTCCaiCOCACaAOCAA A T Ll t A r ilA AOATCC^ 
GGCnTCACGAGACCXriTTGA OG T GOC T C CTCqTTrAGK^ 

»£AI.GNCTDEQISrKNQYLPVI YSl IFbVaPPa> 
^ HUSTARZAW „^ > 



50 



' BXON 4_ 



90 cRl 

32500 

AACACAOTGCCCATCTCCATCTAC Alll ' ltJ UU»TCOOGCCCnTOABGCC^ 

llTVAX8IYXPXMRPWKOSTVZNI,|ILALT0LI.yXi» 
WUSTAHZA M > 

w on 4 > 

33600 

X PPA(M^'TCfXjB^TPCTCATCCA^nvCT^^^^^^^^^^?^?PSAAAACTG(SA'lT^r'f MTlfMVOA^^TCAlXiTCK^A^ V I 'Ci^CfT 

garOGACCWCAACGACTAOCTAATOTACCGTCAOa^CTTrrGAOCTAOAAACCTO 

TSLPPX.tBYYASOENHIFCDFKCKPXRPGPHPirL> 
WUSTAKZA H > 

105 * 

33700 

CTACACCACC Arii: 'i V l i - CCrC A C e m Cl l CA GTCl C 'i iCO fTI A O ^l i UiO ATCATTCACCCCATqAC ClX>CU I ICl A TrCA GAAAA CTClAn OGG CA 

(aiVi t UlC G TAACAGAACCAgrKa^CSAACTCAGAGAAGGCAATOCAACACTACTAACT^^ 
...^ YSS ILPLTCFSLPRYVVI ZHPHSCFSXOKTRWA> 
1 1 U . ^MUSTM«AII *■ 

BXOH 4 ^ > 



99 



328 00 

GTCCTACCTTCaXK:CGC(KntnXWGTCArrrCTTTGCTJ^^ 
CA(XATCX»ACACGCCCCCACACCC»CTAJU^GAMCCATCCACACTACGG^ 

VVACAGVWVXSLVAVHPHTPLITSTTRTNRSAC* 
MUSTARZAK > 



EXOX 4 



32900 

TTWCTCACCA0TrC3W»TCACCTCACTACrATCIUMnW^^ 

LDLTSSDDX*TTIKIITMI.1LTATTPCLPLVIVTLC> 

. W ISTARZAII > 

E XOW 4 > 

3300 0 

CTACACtyUZAATTATCAGTACCCroACTCACCGCCCTC^ 
G A l V lX ^ Cnvri A ATASICATOCGACTGAgr G CCCG a USCCTCGGT^^ 

VTTZXSTLT80PRTHSCPKQKARRLTZLLLLVP> 

^HUSTARZAM > 



>hekP 

I 33100 
TATATATGTOCTTACCCTTCCACATCrnawSCOTCATTCOCATCGAATCTCCCCT 
ATATATACAAA(UU^TGGGAAGaTOTAGAACTC»CAGTAAGCCrAamAGAGCC^ 
YICPLPrHII.RVXRI8SRLLSISCSIB8RIHBA> 

, MUSTARZAW » 



BXOH 4 



33200 

TGTAACAAAGATCTGGTAATCGACGAGAGTTCnaGAAACCaTTCGACGACAATATA^ 

YIV5RPLAALHTPGHLLLyVVVSNHPQQAPCSIV> 

MUSTARZA H > 

B XOW 4 > 

>3*arm 

>3*axio? <hetR 

I I 33300 

GAGATGCAAAGCCACnXXSGACCTTSAACAAGGJUUVGAAAGACACTTGCTa^^ 11 ICGTCA AGGCCAGAOACATCGCTC 



33400 

AGTOGTAAA eCT CTTG A CATCCAAATCTGAOGGTCTOAGATAGAClCTTCAG 

33500 

It. IGlU;f U J«aiTA<aTATGGA»TAGAATCCrrAAACCCTCGAAGC»rAA It! I IC CAAC 

AGACACCAC r CT A TC T ATACCTCtATCTTACGAATITCCSACCTTOOSATTGAOCOC^^ 

33600 

GACATAAAAGGGAACCSACCAATCCTTAAGCCTtrrCCTTTC»GCTCtACAT^^ 
CltTA ' i ' rnU C C i l CC ' IU;iTA CaAAlTC CG ACAGGJUU^CTOmU»TGTACAA(ngMUlCOT 

33700 

ACA(a.TACATACTTATCAAATATCA 01i:riiUJ iTCAATCAATC3UlTCAATaa 
TgrCT A TPr A T G AATAGTTTATAGTCAQAAAGGTAGTTAGTTASTrAS^ 

33B00 

ATCCroAATACTXTCTCACKnAOCCCAa«nW^ACTC»GTO 
TAGGACITATGGGAGACnCACCrCQGGTGTCATTGAltU^CTCATAAG 

33900 

Altfri IX,1 lU lTATCTITGOCTAGAfflUlTOGGCTACAOGACTATCAATAAACTO^^^ 

TACAAACAACTATAGAAACCCATCTCTTACOCCAlUI'C^iCATACT^^ 1 itlAAlCmWiACCTTiaCTCrCATAAAGACAAaCT 

34O0O 

COUlCTaUCTTGCCTCCACACACAAACT A Cl tgJ O G r A C ^ ^ 

34100 

OriCUi 1 1 iU^ ' riVlUlVlCr r iA TACACCGACirCG'IXJrACATGGCGAAACACGUI U ;i{X'i l l ACTTATAAATCACTCCr A TTCUUXKIACCTTGACT 

34200 

CATCCATGGAATCTGaaWGATCAAAGCAAACn'ATTCCAACCrWnm^^^ 

CTACCTACCrrAGAOC LrAVrAGA ' i ICGl ' l IU lTAAO0TTG(aCCAAGGACACTAGTACCTTTTTCAG A CACa»GACrCTAAC^^ 

34300 

GAACTA L ' l 

34400 

ACAGCACACTCCCACACTCCAQCCT3UfcC7ATCACATACCmACAAfiAAfiGAAAAA ffl U8C A A^ 

TO T C CTC T CAGOOTCTa^GCTOSOATTCATACTCTATOaAA AiGl IVl ' lVL ' l 1 1 1 it ACtllCAA CCTAACAACATACACCOSATCIW UL ' ICItUG ' W 

34500 

TTTTCCCTrCTGAGGCCCTTCCTCGOCTAGCCrCACCCriCCT^^ 

AAAACOGAAGACTCCGGGAACGACCO^ATCCGAGTGGGAACGACGAACACATACU^^ I'l'l A riXJU:CGACAAGACTCAATCAGGCOGTCTOCCAAGTCT 

34600 

CAQACCTGCTAGTCCaiTnCTAACTOG Al i iAi i 1 1 1 1 A CTATA7GGGM« a*i 11 IblU CC i lA CATTACTAAATTATCATACACOG GAAAAAAG AGTG 
GTCIGCACGATCAOCTAAACATTGACCTAAATIUUIOAATCATATA 

3*''° ° 

ju£iAi£SroSTCAAAi^^ 

3460 0 

CAACTC A IT A T L TA lACAA m ' nTACriX.t, 1 1 AAI iCGll.iG A b ' 1 ' l ' im.l ' l itA GCTCTQGCTA ULlL A 1 1 A aACTCAAg OAAT ACA'ITAtil I Cl'I'i'AC 
CTTOACmTOCAAATCTTCIUUUUUMU^fiCAAATAACCACACICAAAAOSAAACTC^^ 

34900 

ATGGCCCACTOASCAA Mn» CI U: AATACAAGCATta W^Ci AU^UiLll^ 
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A U ' lti ia ' IUt.lU »C^OCTTOCCCO<TO^ UlW I llA CXa^ imiACI HX b ftAOCCAGACCXCGAAT 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
90 
95 



TCCAAAGACATGCIUnTOGAAOCCaUXrtCTG^^ 
A CVmVIUTA CGTaWXritiG< ^ lC U WCCCCAC C ^^^^ 



35000 

TAGACTACTTAGTGAGCICTA 
ATCTCATGAATCACTCGKAT 



35100 

ACAaVAATACAAACCCTCACrCCAAAACTt»CXy^CCACAGCAAGCTGC«^ 

TtnxnTrATCrrKWGACTCACCTrTTGA C i U ^ ri 'G G TCT LGI I C CACCCCGAGACCGGACrCAAAACTCGCAACATCCC(nXKMATTGTTAAC<»ATO 

35200 

TTATAAATCATArrAACrmArnwrAATrGTOUtfaTrrcci^ 

AATATTTAffTWAATTCAAAATAACCATnaCAinTCTA^^ 

35300 

CnaCTCAAATCrAAC Al iU ill lU iTTtTCiaGQSkTMGAmeCkaGtM^^ UlAATCCTTTAGTACTATAATO^AAA 
(»TTCAGTrntf»TTCTAACAAAACTAAAIUrrACCCTATO 

- 35400 

TAATAACTCTACAAXTTOCTAAA U ' mnCl IGG CCT AiUllUlUJi i ' A ATTA^ 

ATTATTq^CATCTTPOiCGATTTCAATAAQAAOCOCaTlCAACAC C AATrAATC^ i iACOrftCC 

35500 

TCTAGATCrrrrAAACCTTAAA(yj^TCTOrrC»TATCACTAATaAAOT 
AGATCTAGAAAATTTCGA AT TTC TT ACACAACTATACTGATTArrr^ 

3S600 

ATTCATOCTCTATTOAAAACAATGGTTCGAGACCAAAAGGAAGCTTG AAA^ 
TAAGTAATCACATAA LTITltyn ' A CCAAG CICi 'CCTTTTCC rX CGAA CXTri ^ ^ ^ 

35700 

GCOAT»CTCt<SKEkCPICiaimarrG^^ lA AOOCTTTAOtaTCTGAOGTAAATCSCaTACGGOOGACaGTaZACGA 

3SB00 

TCOCCA Cia;! 1 IC l CCGAtOUl WG JtfTOCrACa L^^nTm A CCirCCATlCrACA ^ l ILA CGTOTCrCCTAAACCTTCTACOCCCCA 

35900 

CCACGGACCrCCOtTTGAGGAATntyvCytf^CCTCGGGCXr^^ 
GGrrCCCTCCaiaWSCACTCCTTAAACTCrrCGGACCCC^^ 

ATTCT0GA C TCClCC G AC a .TGGTO3C0CTACCTTTTACAC^^ 

36100 

lt.rritltJ tf30GAGA CT rAGATCC>fiCTCTCTA<UTa^^ 
ACAAACACTCCC' r Cl CA Al C r A CGTCtMtfaiCATCT3 U;il C U;iCA GA<^^ 

36200 

GTTTACXGATTATroTOCTOGJUUUkTOBCTGCATACCACCTCCTCA^^ 
CMUnGACTAATAAGTW^ACCrrrrXCCCAarTATCWriOCl^^ 

36300 

ACTrACTCGCTAACTCTOGGG G TCTCT A CATCATTItfpIOTCatfi^^ 
TGAATGACCGATTGACAGCaxaOAGATCTAOTAATCATI^CCTCTTG^ 



TACACTCTCTTCCAACAO 



C CCCACCAAAGTACAGOGAT A TAAACAT 
GGQGTCSGTTTCATGTOOCTATAT 



ACAC3UXra^aVCACA(»0(nCTATTAGGAAAATCriTA( 
ATAATCCTTTTACAAAT 

36500 

a^TCTKXZACAGATnXXnXSACTCT AXVmtiGa. ^ ^ ^ ^ 
ATAfiACCGTtnCrAACOGACTCACATACAAAOCGGAATAAAACAATAAAGPrCCCT^ 



TCCAAa^CrrJU^TOOCA C ' nXX T CC AATlTITAAC Ai ' liTiAlUA CgC^^ 

36700 

CA a. ' lXJUL ' X ' IU ATTCAg L ' i ' l i I iOlCKC ' ltliU ' l I AUJ ' J W CCrCrA l CCC O CO O Q CaCA A OCAAACAC TtaTCACAOT CTOOCC^^ 
GTCGAGOCAACTAACnTOUUUUlCACroAGACAATOCatfTGGACATAUA;^ 

36800 

TTGAGCATAATCACAAAGGTCCAAGCGAAATTGAATTCAAAAAU-riU;riTIAi 1 1 iCCAAAG'f A'ltCUJ'i It IGG ACTAATTGGGCTAOAGGTTGCATP 
AACTCCTATTAGTCTTTCCACGTTCGCTrrAACTTAACTT^^ 

36900 

TTGAACTGAATCCCCTCCCCTCCAAAACGACAGATAATt»TACC(a3CTACACTACG^ 

<3'aziiA O'prP 

I I 37000 

lATCAAGCTUrrtnglATrACAAQSAAACTCKrCATGCATGC^^ 

3>>TAA H;ntt.l ' i lb ACCGTACCTACGA<m XA ;r A CrCCCCTCACCCCACCC7roAgITC^ 

37100 

ATACAATAACTCCCCCATirnUTTCTATGAGTGCCTCACAGAAATGTTCAA 
ATATGrrATTGAOSGanAAAACIAACATACTCACGGAGTCnCIT^^ 



CACCT CAGAAAATCCAGAATAAAGAAAC TCTGAAT 
<nYiGACTCTTTTAGaTCXTATTTCTTTQACACTTAI 



<3*prR 



AACCACACnTAAACAAAATCAC AAAA CCGAWltXrr A CCTTAAACA TWT g rCA C ^^ 
TIOGTCTClUlTTTCrTTn^GTCrrnQCC^ 



3T200 
ATCTGTCTC 
ATAGAOCaO 
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Each of the applications and patents mentioned in this document, and each 
document cited or referenced in each of the above applications and patents, including 
during the prosecution of each of the applications and patents ("application cited 
documents") and any manufacturer's instructions or catalogues for any products cited or 
5 mentioned in each of the applications and patents and in any of the application cited 

documents, are hereby incorporated herein by reference. Furthermore, all documents cited 
in this text, and all documents cited or referenced in documents cited in this text, and any 
manufacturer's instructions or catalogues for any products cited or mentioned in this text, 
are hereby incorporated herein by reference. 



1 0 Various modifications and variations of the described methods and system of the 

invention will be apparent to those skilled in the art without departing from the scope and 
spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications 

15 of the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the claims. 
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Claims 

1 . A Tarzan GPCR polypeptide comprising the amino acid sequence shown in SEQ 
ID NO. 3 or SEQ ID NO: 5, or a homologue, variant or derivative thereof. 

2. A nucleic acid encoding a polypeptide according to Claim 1 . 

S3. A nucleic acid according to Claim 2, comprising the nucleic acid sequence shown 
in SEQ ID No. 1, SEQ ID No.2 or SEQ ID NO: 4, or a homologue, variant or derivative 
thereof. 

4. A polypeptide comprising a fragment of a polypeptide according to Claim 1 . 

5. A polypeptide according to Claim 3 which comprises one or more regions which 
10 are homologous between SEQ ID No. 3 and SEQ ID No. 5, or which comprises one or 

more regions which are heterologous between SEQ ID No. 3 and SEQ ID No. 5. 

6. A nucleic acid encoding a polypeptide according to Claim 4 or S. 

7. A vector comprising a nucleic acid according to Claim 2, 3, or 6. 

8. A host cell comprising a nucleic acid according to Claim 2, 3, or 6, or vector 
1 5 according to Claim 7. 

9. A transgenic non-hiunan animal comprising a nucleic acid according to Claim 2» 3 
or 6, or a vector according to Claim 7. 

10. A transgenic non-human animal according to Claim 9 which is a mouse. 
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1 1 . Use of a polypeptide according to Claim 1, 4 or 5 in a method of identifying a 
compound which is capable of interacting specifically with a G protein coupled receptor. 

12. Use of a transgenic non-human animal according to Claim 9 or 10 in a method of 
identifying a compound which is capable of interacting specifically with a G protein 

5 coupled receptor. 

13. A method for identifying an antagonist of a Tarzan GPCR» the method comprising 
contacting a cell ^ich expresses Tarzan receptor with a candidate compound and 
determining whether the level of cyclic AMP (cAMP) in the cell is lowered as a result of 
said contacting. 



10 14. A method for identifying a compound capable of lowering the endogenous level of 
cyclic AMP in a cell which method comprises contacting a cell which expresses a Tarzan 
GPCR with a candidate compound and determining whether the level of cyclic AMP 
(cAMP) in the cell is lowered as a result of said contacting. 

1 5. A method of identifying a compound capable of binding to a Tarzan GPCR 
1 5 polypeptide, the method comprising contacting a Tarzan GPCR polypeptide with a 

candidate compoimd and determining whether the candidate compound binds to the 
Tarzan GPCR polypeptide. 

16. A compound identified by a method according to any of Claims 1 1 to 1 5. 

17. A compound capable of binding specifically to a polypeptide according to Claim 1 , 
20 4 or 5. 

1 8. Use of a polypeptide according to Claim 1 , 4 or 5, or part thereof or a nucleic acid 
according to Claim 2, 3 or 6» in a method for producing antibodies. 
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1 9. An antibody capable of binding specifically to a polypeptide according to Claim 1 , 
4 or 5, or part thereof or a polypeptide encoded by a nucleotide according to Claim 2, 3 or 
6, or part thereof. 

20. A pharmaceutical composition comprising any one or more of the following: a 

5 polypeptide according to Claim 1, 4 or 5, or part thereof; a nucleic acid according to Claim 
2, 3 or 6, or part thereof; a vector according to Claim 7; a cell according to Claim 8; a 
compound according to Claim 16 or 17; and an antibody according to Claim 19, together 
with a pharmaceutically acceptable carrier or diluent. 

21. A vaccine composition comprising any one or more of the following: a polypeptide 
1 0 according to Claim 1 , 4 or 5, or part thereof; a nucleic acid according to Claim 2, 3 or 6, or 

part thereof; a vector according to Claim 7; a cell according to Claim 8; a compound 
according to Claim 16 or 17; and an antibody according to Claim 19. 

22. A diagnostic kit for a disease or susceptibility to a disease comprising any one or 
more of the following: a polypeptide according to Claim 1 , 4 or 5, or part thereof; a 

1 5 nucleic acid according to Claim 2, 3 or 6, or part thereof; a vector according to Claim 7; a 
cell according to Claim 8; a compound according to Claim 16 or 17; and an antibody 
according to Claim 19. 

23. A method of treating a patient suffering from a disease associated with enhanced 
activity of a Tarzan GPCR, which method comprises administering to the patient an 

20 antagonist of Tarzan GPCR. 

24. A method of treating a patient suffering from a disease associated with reduced 
activity of a Tarzan GPCR, which method comprises administering to the patient an 
agonist of Tarzan GPCR. 

25. A method according to Claim 23 or 24, in which the Tarzan GPCR comprises a 
25 polypeptide having the sequence shown in SEQ ID NO: 3 or SEQ ID NO: 5. 
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26. A method for treating and/or preventing a disease in a patient, which comprises the 
step of administering any one or more of the following to the patient: a polypeptide 
according to Claim 1, 4 or 5, or part thereof; a nucleic acid according to Claim 2, 3 or 6, or 
part thereof; a vector according to Claim 7; a cell according to Claim 8; a compound 

'5 according to Claim 16 or 17; an antibody according to Claim 19; a pharmaceutical 
composition according to Claim 20; and a vaccine according to Claim 20. 

27. An agent comprising a polypeptide according to Claim 1 , 4 or 5, or part thereof; a 
nucleic acid according to Claim 2, 3 or 6» or part thereof; a vector according to Claim 7; a 
cell according to Claim 8; a compound according to Claim 16 or 17; and/or an antibody 

1 0 according to Claim 19, said agent for use in a method of treatment or prophylaxis of 
disease. 

28. Use of a polypeptide according to Claim 1, 4 or 5, or part thereof; a nucleic acid 
according to Claim 2, 3 or 6, or part thereof; a vector according to Claim 7; a cell 
according to Claim 8; a compound according to Claim 16 or 17; and an antibody 

15 according to Claim 19, for the preparation of a pharmaceutical composition for the 
treatment or prophylaxis of a disease. 

29. A non-human transgenic animal, characterised in that the transgenic animal 
comprises an altered Tarzan gene. 

30. A non-human transgenic animal according to Claim 29, in which the alteration is 
20 selected from the group consisting of: a deletion of Tarzan, a mutation in Tarzan resulting 

in loss of function, introduction of an exogenous gene having a nucleotide sequence with 
targeted or random mutations into Tarzan, introduction of an exogenous gene from another 
species into Tarzan, and a combination of any of these. 



25 



31. A non-human transgenic animal having a functionally disrupted endogenous 
Tarzan gene, in which the transgenic animal comprises in its genome and expresses a 
transgene encoding a heterologous Tarzan protein. 
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32. A nucleic acid construct for functionally disrupting a Tarzan gene in a host cell, the 
nucleic acid construct comprising: (a) a non-homologous replacement portion; (b) a first 
homology region located upstream of the non-homologous replacement portion, the first 
homology region having a nucleotide sequence with substantial identity to a first Tarzan 
5 gene sequence; and (c) a second homology region located downstream of the non- 
homologous replacement portion, the second homology region having a nucleotide 
sequence with substantial identity to a second Tarzan gene sequence, the second Tarzan 
gene sequence having a location downstream of the first Tarzan gene sequence in a 
naturally occurring endogenous Tarzan gene. 

10 33 . A process for producing a Tarzan GPCR polypeptide, the method comprising 
cultiuing a host cell according to Claim 8 under conditions in which a nucleic acid 
encoding a Tarzan GPCR polypeptide is expressed. 

34. A method of detecting the presence of a nucleic acid according to Claim 2, 3 or 6 
in a sample, the method comprising contacting the sample with at least one nucleic acid 

1 5 probe which is specific for said nucleic acid and monitoring said sample for the presence 
of the nucleic acid. 

35. A method of detecting the presence of a polypeptide according to Claim 1 » 4 or S 
in a sample, the method comprising contacting the sample with an antibody according to 
Claim 1 9 and monitoring said sample for the presence of the polypeptide. 

20 36. A method of diagnosis of a disease or syndrome caused by or associated with 
increased, decreased or otherwise abnormal expression of Tarzan GPCR, the method 
comprising the steps of (a) detecting the level or pattem of expression of Tarzan GPCR in 
an animal suffering or suspected to be suffering from such a disease; and (b) comparing 
the level or pattem of expression with that of a normal animal. 

25 37. A nucleic acid sequence selected firom group consisting of musTarzan S'prF 
(TTTGATGGAGTAGATTCAGTGTCAGAC), musTarzan 5*prR 
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(GTCCCAGAAGAGCTTGGGAATCCAAAC), musTarzan 5'annF.2 Age 
(tttaccgGTTCCTACATTAGCGACTTTTCTGGAC), musTarzan 5'armRNot 
(aaagcggccgCCAGGAAATCCGAATCACTGGCTGGAC), musTarzan 3'annF Asc 
(aaaggcgcgccGATGCAAAGCCAGTGGGGACCTTGAAC), musTarzan 3'armR Fse 
5 (tttggccGGcCTGAGCATGCATGGCAGTTTCCTTG), musTarzan 5'scr.2 
(AATCATCCTTCTTGCTGGGGTTCAGAC), musTarzan 3'prF 
(CGTGGCACTCAAGTCTGGATGCACCTG), musTarzan 3'prR 
(AGACACTAGGACATTAATCCCTGTGAC), musTarzan hetF 
(TCCACATCTTGAGGGTCATTCGGATCG), musTarzan hetR 
10 (TTCTTTCCTTGTTCAAGGTCCCCACTG), musTarzan Fl 
(GGATTTCCTGGATTACCCAAGTGC), musTarzan Rl 
(CAGATACAGCAAGTCCGTCAAGGC), Asc306 
(AATGGCCGCTTTTCTGGATTCATCGAC) and DRl 
(CATGCCGCCTGCGCCCTATTGATCATG). 

15 38. A nucleic acid sequence, including a Tarzan nucleic acid sequence, a polypeptide 
sequence, including a Tarzan polypeptide sequence, a method of treatment, a method of 
diagnosis, a host cell, vector, transgenic animal, assay, diagnostic kit, vaccine, 
phamaceutical composition or agent substantially as hereinbefore described with reference 
to and as shown in the accompanying dravnngs. 
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